Summary
Overview
Work History
Skills
Timeline
Generic

Srinivasu Dama

Frisco,TX

Summary

Transitioning from data-centric environment with focus on developing efficient data solutions and optimizing workflows. Skilled in data architecture, database management, SQL, and Python, with track record of enhancing data-driven decision-making processes. Seeking to apply these transferrable skills in new field, bringing consultative approach to solving complex problems and improving operational efficiency.

Overview

8
8
years of professional experience

Work History

SR.Data Engineer

CITI
08.2023 - Current
  • Designed and implemented real-time ETL pipelines using Apache Kafka, Spark Streaming, and NiFi to process and analyze high-velocity financial data
  • Built and maintained data lakes and warehouses on AWS using S3, Redshift, and Snowflake for scalable data storage and analytics
  • Optimized Spark jobs by leveraging file formats like ORC, Parquet, and AVRO for performance improvements in data processing
  • Deployed scalable data ingestion frameworks for structured and unstructured data from multiple sources, including relational and NoSQL databases
  • Developed and automated workflows using Apache Airflow and Control-M for data scheduling and orchestration
  • Ensured high availability and scalability by containerizing data pipelines with Docker and orchestrating deployments with Kubernetes
  • Conducted real-time anomaly detection and financial fraud analysis using Spark MLlib and Python libraries (pandas, scikit-learn)
  • Collaborated with compliance teams to ensure data security, privacy, and adherence to industry regulations like SOX and PCI DSS
  • Created interactive dashboards with Tableau and AWS Quick Sight for financial risk insights and monitoring
  • Provided technical documentation and training to data engineering and business intelligence teams for effective system utilization

Bigdata Engineer

Florida Blue
02.2022 - 07.2023
  • Built scalable ETL pipelines using Apache NiFi, Spark, and HiveQL to ingest, transform, and process large volumes of structured and unstructured healthcare claims data
  • Designed and implemented a centralized data lake on AWS using S3, Glue, and Redshift for secure and efficient data storage and querying
  • Utilized Snowflake for advanced data warehousing and analytics, enabling faster query execution and integration with reporting tools
  • Enhanced fraud detection systems by implementing real-time anomaly detection frameworks with Kafka and Spark Streaming
  • Conducted data cleansing, transformation, and enrichment to ensure high data quality and integrity
  • Automated workflows and job scheduling using Apache Airflow and Control-M to streamline daily operations
  • Collaborated with compliance teams to ensure adherence to healthcare data privacy regulations, including HIPAA
  • Developed dashboards and reports using Tableau and AWS Quick Sight to provide actionable insights for stakeholders
  • Performed root cause analysis and performance optimization for Spark jobs to improve processing times and resource utilization
  • Documented technical workflows and conducted knowledge-sharing sessions to support the data engineering team

Data Engineer

Mastercard
04.2019 - 12.2021
  • Built and optimized data pipelines using Apache Spark, Kafka, and NiFi for real-time ingestion and transformation of transaction data
  • Designed and implemented a secure, scalable data lake architecture on AWS using S3, Redshift, and Glue for data storage and analytics
  • Developed ETL processes to transform raw transaction data into actionable insights, ensuring high accuracy and reliability
  • Integrated Snowflake for advanced analytics and faster query execution, enabling complex financial reporting and risk analysis
  • Conducted data modeling and warehousing to support Mastercard's data analytics initiatives, focusing on scalability and performance optimization
  • Enhanced fraud detection systems using machine learning models implemented in Python and integrated with Spark MLlib
  • Automated workflows and job scheduling using Apache Airflow and Control-M for efficient pipeline orchestration
  • Monitored and optimized Spark job performance by leveraging advanced memory management techniques and optimized file formats like ORC, Parquet, and AVRO
  • Ensured compliance with PCI DSS standards and implemented data security measures, including encryption and role-based access controls
  • Developed interactive dashboards and visualizations using Tableau and AWS Quick Sight for real-time transaction monitoring and business insights

Data Engineer

Cognizant
06.2017 - 03.2019
  • Developed and maintained ETL pipelines using Apache Spark, Sqoop, and Hive for data ingestion and processing across various retail systems
  • Built a centralized data lake on Hadoop and AWS S3 to store and manage structured and unstructured retail data
  • Optimized Spark jobs using advanced memory management techniques and efficient file formats like Parquet, ORC, and AVRO for faster query execution
  • Integrated streaming data from multiple sources using Apache Kafka and processed it in real time with Spark Streaming
  • Conducted data modeling and implemented data warehousing solutions using Hive and Snowflake to support advanced analytics and reporting
  • Automated workflows and job scheduling using Apache Oozie and Control-M, reducing manual intervention and ensuring smooth pipeline operations
  • Collaborated with data science teams to support advanced analytics use cases, including customer segmentation and demand forecasting
  • Ensured data security and compliance by implementing encryption and access control policies
  • Created interactive dashboards and reports using Tableau to visualize sales performance and inventory trends for stakeholders
  • Provided end-user training and documentation to ensure effective use of the analytics platform

Skills

  • Git version control
  • ETL development
  • Big data processing
  • Python programming

Timeline

SR.Data Engineer

CITI
08.2023 - Current

Bigdata Engineer

Florida Blue
02.2022 - 07.2023

Data Engineer

Mastercard
04.2019 - 12.2021

Data Engineer

Cognizant
06.2017 - 03.2019
Srinivasu Dama