Results-driven Data Engineer with over 5 years of experience in designing and optimizing data processing solutions on Microsoft Azure. Proven track record of automating data pipelines and enhancing ETL workflows using Bash, Perl, and Python. Expertise in implementing DevOps practices for CI/CD and utilizing Azure Synapse Studio and SQL Server Development to streamline operations. Adept in user-friendly UI/UX design for data visualization with Adobe XD and Figma, while also skilled in real-time data streaming with Apache Kafka.
Results-driven Data Engineer known for high productivity and efficient task completion. Skilled in big data processing frameworks like Hadoop and Apache Spark, database management using SQL, and data visualization with tools such as Tableau. Excel in problem-solving, collaboration, and adaptability to leverage technical skills in developing innovative data solutions across diverse environments. Results-oriented individual with a passion for continuous learning and innovation. Known for leveraging analytical thinking and creativity to solve problems and deliver high-impact solutions in fast-paced environments. Highly competent Data Engineer with background in designing, testing, and maintaining data management systems. Possess strong skills in database design and data mining, coupled with adeptness at using machine learning to improve business decision making. Previous work resulted in optimizing data retrieval processes and improving system efficiency.
Overview
7
7
years of professional experience
1
1
Certification
Work History
Data Engineer
UnitedHealth Group
07.2023 - Current
Designed, constructed, and maintained scalable data pipelines for data ingestion, cleaning, and processing using Python and SQL.
Conducted data analysis using SQL and Python to derive insights and support decision-making processes.
Monitored data systems performance, identifying bottlenecks and implementing solutions to maintain system efficiency.
Managed version control and deployment of data applications using Git, Docker, and Jenkins.
Established and enforced data governance policies and procedures to comply with regulatory requirements and ensure data privacy.
Implemented and optimized big data storage solutions, including Hadoop and NoSQL databases, to improve data accessibility and efficiency.
Provided technical mentorship to junior data engineers, guiding them on best practices and project execution.
Streamlined data flow from diverse sources using ETL tools such as Talend, Informatica, and Airflow.
Implemented data visualization tools like Tableau and Power BI to create dashboards and reports for business stakeholders.
Analyzed user requirements, designed and developed ETL processes to load enterprise data into the Data Warehouse.
Configured and maintained cloud-based data infrastructure on platforms like AWS, Azure, and Google Cloud to enhance data storage and computation capabilities.
Developed Python scripts for extracting data from web services API's and loading into databases.
Designed data warehousing solutions, applying dimensional modeling techniques for optimized data retrieval.
Developed and deployed machine learning models for predictive analytics, utilizing Spark and TensorFlow.
Collaborated with cross-functional teams to gather requirements and translate business needs into technical specifications for data solutions.
Participated in agile development processes, contributing to sprint planning, stand-ups, and reviews to ensure timely delivery of data projects.
Conducted system analysis and testing to identify and resolve technical issues or inefficiencies.
Data Engineer
MGM Resorts International
01.2023 - 05.2023
Designed, constructed, and maintained scalable data pipelines for data ingestion, cleaning, and processing using Python and SQL.
Managed version control and deployment of data applications using Git, Docker, and Jenkins.
Established and enforced data governance policies and procedures to comply with regulatory requirements and ensure data privacy.
Conducted rigorous testing and validation of data pipelines to ensure accuracy and completeness of data.
Streamlined data flow from diverse sources using ETL tools such as Talend, Informatica, and Airflow.
Implemented data visualization tools like Tableau and Power BI to create dashboards and reports for business stakeholders.
Developed and implemented data models, database designs, data access and table maintenance codes.
Analyzed user requirements, designed and developed ETL processes to load enterprise data into the Data Warehouse.
Configured and maintained cloud-based data infrastructure on platforms like AWS, Azure, and Google Cloud to enhance data storage and computation capabilities.
Optimized SQL queries and database schemas for performance improvements in data retrieval operations.
Developed Python scripts for extracting data from web services API's and loading into databases.
Data Engineer
Citi Bank
06.2018 - 01.2022
Designed and managed ETL workflows using Talend and SSIS, optimizing SQL Server performance for financial data processing and aligning with machine learning-based solutions for enhanced data analysis
Automated data integration from multiple sources, including Azure Data Lake and Fabric, ensuring consistent and timely reporting of E-Commerce transaction data
Developed and optimized data pipelines in PySpark for large-scale financial transaction processing, ensuring efficient data transformation and integration within the banking system
Implemented Databricks solutions for processing banking data, enhancing data quality, and analytics workflows
Designed and maintained database schemas to support high-performance querying and transaction processing
Architected and deployed secure cloud-based infrastructure for banking applications, utilizing AWS services (EC2, S3) and implementing best practices for network security and data protection
Led the implementation of CI/CD pipelines using Jenkins and AWS tools to automate testing and deployment of banking applications, reducing release cycle times and increasing code quality
Optimized T-SQL stored procedures, improving query execution by 25%, enhancing the performance of financial reports, and leveraging ELK Stack for efficient log analysis and performance monitoring
Ensured data governance compliance for sensitive Personally Identifiable Information (PII), incorporating machine learning models to enhance security and adhere to regulatory standards
Developed and deployed real-time API integrations for financial data analysis, supporting SQL database integration and leveraging AI/ML-based solutions for deeper insights
Collaborated with business teams to define and track key performance indicators (KPIs) using Power BI and Tableau
Orchestrated data workflows using Apache NiFi and Airflow, improving pipeline reliability for critical financial systems and integrating Python ML frameworks for predictive analysis
Engineered real-time data ingestion and analysis solutions using Azure Event Hubs and Stream Analytics
Senior Provider Relations Advocate, Account Manage at UnitedHealth Care, UnitedHealth GroupSenior Provider Relations Advocate, Account Manage at UnitedHealth Care, UnitedHealth Group
Clinical Transformation Manager at UnitedHealth Group- UnitedHealth Care DivisionClinical Transformation Manager at UnitedHealth Group- UnitedHealth Care Division