Seasoned Data Engineer with 3+ years of experience excelling in optimizing data storage mechanisms and contributing across diverse Data Engineering domains, including Data Pipeline Design, Analysis, Integration, and Governance.
Proficient in data modeling, schema design, and performance optimization, leveraging cloud platforms (AWS, Azure) to deliver scalable and resilient data solutions.
Demonstrated expertise in SQL, encompassing Joins, Aggregation, Windowing functions, and relational databases (Oracle, MySQL, PostgreSQL) for modeling, querying, and administration tasks.
Proficient in Python and PySpark for automating data processing workflows and enhancing efficiency in data engineering processes.
Strong analytical, problem-solving, communication, and interpersonal skills, ensuring successful project execution and fostering effective collaboration in cross-functional team environments.
Overview
4
4
years of professional experience
1
1
Certification
Work History
Data Engineer
Molina HealthCare
01.2024 - Current
Spearheaded the strategic migration of healthcare data and applications to AWS cloud platforms, achieving enhanced performance, substantial cost savings, and robust disaster recovery and high availability solutions
Orchestrated the design and implementation of a hybrid cloud system utilizing AWS and Azure, enhancing data processing efficiency by 35% and ensuring flawless data flow between AWS Glue and Azure Synapse Analytics for processing the data
Achieved enhanced data accessibility and security by integrating AWS CloudFront for content delivery, reducing latency and improving user experience, alongside the implementation of AWS IAM policies and security best practices to prevent unauthorized access
Utilized AWS CloudWatch for continuous monitoring, identifying and addressing performance issues to maintain a notable 20% increase in resource uptime
Designed and deployed custom Kafka Connect connectors to integrate with non-relational databases, cloud storage solutions, and third-party APIs, expanding data integration capabilities beyond traditional RDBMS systems
Developed customized Spark transformations and actions in Python, augmenting Spark's capabilities to manage intricate data processing tasks, resulting in a 50% increase in data throughput and significantly improved operational efficiency
Crafted advanced SQL stored procedures for OLTP database solutions on Microsoft SQL Server, ensuring optimized data processing and management efficiency
Established and enforced rigorous data governance practices to maintain data integrity and compliance, adhering to industry standards and significantly improving overall data management protocols.
Data Analyst
Trigent Software
04.2020 - 07.2022
Architected, developed, and implemented a robust big data platform, delivering strategic solutions to tackle key business challenges and enhance data analysis capabilities
Extensively leveraged Azure Databricks and Data Factory as ETL platforms, ensuring efficient and effective data processing and transformation
Automated data extraction jobs from multiple data sources, including Oracle, SQL Server, and MySQL, seamlessly pushing result sets to cloud storage solutions like Azure Blob Storage
Leveraged data-driven decision-making by 35% through developing interactive reports and dashboards in Power BI to visualize KPIs and trends and improved report generation speed by 25% by optimizing Power BI reports for performance
Achieved a 30% increase in data analysis efficiency by effectively manipulating and analyzing large datasets using functions like VLOOKUP, INDEX/MATCH, and SUMIFS
Implemented robust data encryption and access controls to safeguard sensitive risk management data, leveraging Azure Key Vault and Snowflake's advanced security features
Integrated SSIS with SQL Server ecosystem components, such as SQL Server Database Engine, SQL Server Analysis Services (SSAS), and SQL Server Reporting Services (SSRS), creating end-to-end data solutions to support business intelligence and reporting needs
Developed and maintained data pipelines for ingesting, processing, and analysing semi-structured and unstructured data in NoSQL databases, ensuring data consistency and reliability
Demonstrated advanced proficiency in Spark Core, SQL, and Spark Streaming for comprehensive and efficient data processing
Collaborated within Agile frameworks, actively engaging in sprint planning, daily stand-ups, and retrospectives to ensure timely project delivery, continuous improvement, and alignment with business objectives.
Jr Software Engineer Intern
Fly High Solutions
12.2019 - 03.2020
Executed the Person Verification project, strategically promoting Angus Ltd products through thematic campaigns; achieved a 30% rise in conversion rates and enhanced brand visibility by 20%
Assisted in developing front-end and back-end modules using Python with Django Framework, and supported in designing MySQL-based data management systems
Supported integration efforts with Angus Ltd's CRM system using RESTful APIs and JSON for enhanced data interoperability
Collaborated on exploring machine learning techniques for enhancing data analysis and prediction accuracy within the project scope
Actively engaged in learning and professional development to enhance skills in SQL technologies, database management systems, and data engineering methodologies.
Education
Master of Science - Computer Science with Data Science
University of Missouri Kansas
Kansas City, Missouri
Bachelor of Technology - Computer Science and Engineering
Sri Venkateshwara University
Skills
Big Data Tools: Hadoop Ecosystem: Map Reduce, Spark, Airflow, HBase, Hive, Pig, Kafka, Hadoop
Programming Languages: Python, R, Java, SQL, and Scala
Methodologies: System Development Life Cycle (SDLC), Agile
Cloud Platform: AWS, Azure
Data Visualization Tools: Power BI, Tableau, MS Excel