Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

SREESANTHREDDY SAMALA

Kansas City,MO

Summary

Seasoned Data Engineer with 3+ years of experience excelling in optimizing data storage mechanisms and contributing across diverse Data Engineering domains, including Data Pipeline Design, Analysis, Integration, and Governance. Proficient in data modeling, schema design, and performance optimization, leveraging cloud platforms (AWS, Azure) to deliver scalable and resilient data solutions. Demonstrated expertise in SQL, encompassing Joins, Aggregation, Windowing functions, and relational databases (Oracle, MySQL, PostgreSQL) for modeling, querying, and administration tasks. Proficient in Python and PySpark for automating data processing workflows and enhancing efficiency in data engineering processes. Strong analytical, problem-solving, communication, and interpersonal skills, ensuring successful project execution and fostering effective collaboration in cross-functional team environments.

Overview

4
4
years of professional experience
1
1
Certification

Work History

Data Engineer

Molina HealthCare
01.2024 - Current
  • Spearheaded the strategic migration of healthcare data and applications to AWS cloud platforms, achieving enhanced performance, substantial cost savings, and robust disaster recovery and high availability solutions
  • Orchestrated the design and implementation of a hybrid cloud system utilizing AWS and Azure, enhancing data processing efficiency by 35% and ensuring flawless data flow between AWS Glue and Azure Synapse Analytics for processing the data
  • Achieved enhanced data accessibility and security by integrating AWS CloudFront for content delivery, reducing latency and improving user experience, alongside the implementation of AWS IAM policies and security best practices to prevent unauthorized access
  • Utilized AWS CloudWatch for continuous monitoring, identifying and addressing performance issues to maintain a notable 20% increase in resource uptime
  • Designed and deployed custom Kafka Connect connectors to integrate with non-relational databases, cloud storage solutions, and third-party APIs, expanding data integration capabilities beyond traditional RDBMS systems
  • Developed customized Spark transformations and actions in Python, augmenting Spark's capabilities to manage intricate data processing tasks, resulting in a 50% increase in data throughput and significantly improved operational efficiency
  • Crafted advanced SQL stored procedures for OLTP database solutions on Microsoft SQL Server, ensuring optimized data processing and management efficiency
  • Established and enforced rigorous data governance practices to maintain data integrity and compliance, adhering to industry standards and significantly improving overall data management protocols.

Data Analyst

Trigent Software
04.2020 - 07.2022
  • Architected, developed, and implemented a robust big data platform, delivering strategic solutions to tackle key business challenges and enhance data analysis capabilities
  • Extensively leveraged Azure Databricks and Data Factory as ETL platforms, ensuring efficient and effective data processing and transformation
  • Automated data extraction jobs from multiple data sources, including Oracle, SQL Server, and MySQL, seamlessly pushing result sets to cloud storage solutions like Azure Blob Storage
  • Leveraged data-driven decision-making by 35% through developing interactive reports and dashboards in Power BI to visualize KPIs and trends and improved report generation speed by 25% by optimizing Power BI reports for performance
  • Achieved a 30% increase in data analysis efficiency by effectively manipulating and analyzing large datasets using functions like VLOOKUP, INDEX/MATCH, and SUMIFS
  • Implemented robust data encryption and access controls to safeguard sensitive risk management data, leveraging Azure Key Vault and Snowflake's advanced security features
  • Integrated SSIS with SQL Server ecosystem components, such as SQL Server Database Engine, SQL Server Analysis Services (SSAS), and SQL Server Reporting Services (SSRS), creating end-to-end data solutions to support business intelligence and reporting needs
  • Developed and maintained data pipelines for ingesting, processing, and analysing semi-structured and unstructured data in NoSQL databases, ensuring data consistency and reliability
  • Demonstrated advanced proficiency in Spark Core, SQL, and Spark Streaming for comprehensive and efficient data processing
  • Collaborated within Agile frameworks, actively engaging in sprint planning, daily stand-ups, and retrospectives to ensure timely project delivery, continuous improvement, and alignment with business objectives.

Jr Software Engineer Intern

Fly High Solutions
12.2019 - 03.2020
  • Executed the Person Verification project, strategically promoting Angus Ltd products through thematic campaigns; achieved a 30% rise in conversion rates and enhanced brand visibility by 20%
  • Assisted in developing front-end and back-end modules using Python with Django Framework, and supported in designing MySQL-based data management systems
  • Supported integration efforts with Angus Ltd's CRM system using RESTful APIs and JSON for enhanced data interoperability
  • Collaborated on exploring machine learning techniques for enhancing data analysis and prediction accuracy within the project scope
  • Actively engaged in learning and professional development to enhance skills in SQL technologies, database management systems, and data engineering methodologies.

Education

Master of Science - Computer Science with Data Science

University of Missouri Kansas
Kansas City, Missouri

Bachelor of Technology - Computer Science and Engineering

Sri Venkateshwara University

Skills

  • Big Data Tools: Hadoop Ecosystem: Map Reduce, Spark, Airflow, HBase, Hive, Pig, Kafka, Hadoop
  • Programming Languages: Python, R, Java, SQL, and Scala
  • Methodologies: System Development Life Cycle (SDLC), Agile
  • Cloud Platform: AWS, Azure
  • Data Visualization Tools: Power BI, Tableau, MS Excel
  • Packages: NumPy, Pandas, Matplotlib, SciPy, dplyr
  • Databases: MySQL, SQL Server, Azure SQL DB, Dynamo DB, Oracle 12c/11g
  • ETL/Data warehouse Tools: Informatica PowerCenter, SSIS, Apache NiFi, Talend

Certification

Microsoft Certified: Azure Data Engineer Associate. Certified in AWS, Azure, GCP Databricks Platform Architect, Databricks Lakehouse Fundamentals, Agile Project Management (by Atlassian, LinkedIn), Data Analysis (Microsoft/LinkedIn), GitHub Professional (by GitHub), Simplifying Data Pipelines with Apache Kafka (by IBM).

Timeline

Data Engineer

Molina HealthCare
01.2024 - Current

Data Analyst

Trigent Software
04.2020 - 07.2022

Jr Software Engineer Intern

Fly High Solutions
12.2019 - 03.2020

Master of Science - Computer Science with Data Science

University of Missouri Kansas

Bachelor of Technology - Computer Science and Engineering

Sri Venkateshwara University
SREESANTHREDDY SAMALA