Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

MEDHA VIJAYVARGIA

Jacksonville,FL

Summary

Results-driven Senior Data Engineer with almost 13 years of expertise in designing and implementing scalable data infrastructures. Certified in Google Cloud and AWS, I excel in leveraging AWS, Azure, and GCP to uncover insights from vast data ecosystems. Proficient in Python and SQL, I transform complex data into actionable insights using advanced Machine Learning and AI techniques. Skilled in creating impactful visualizations with various reporting tools, I ensure robust pipeline construction, precise modeling, and stringent compliance.

Overview

14
14
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

Contract Jobs - Jacksonville, FL
Jacksonville, FL
04.2023 - Current
  • Led successful data migration projects, reducing data redundancy by 50% and enhancing data availability by 20%
  • Enhanced Big Data workflows through AWS S3, Lambda, Step Functions, and EMR cluster integration, resulting in a 50% decrease in data transfer time and 30% lower processing costs
  • Coordinated teams and stakeholders, improving project timelines by 15% and reducing conflicts by 25%
  • Received an average stakeholder satisfaction rating of 9.5/10
  • Streamlined data processing with a 5-step automated workflow, reducing manual intervention by 80% and improving system performance by 25%
  • Improved data accessibility and integration efficiency by 40% by implementing processes for data ingestion and integration using GCP and AWS, enhancing the company's data infrastructure
  • Ensured data accuracy and consistency with a 99.9% success rate by developing and managing ETL pipelines to ingest data from various sources
  • Streamlined Snowflake queries and warehouse configurations, improving overall query performance by 40%
  • Designed Snowflake schemas, cutting storage costs by 30% with SQL and AWS
  • Improved data processing efficiency by 30% and reliability by 25% by utilizing Apache Airflow for orchestrating complex workflows
  • Developed and deployed 10+ AI/ML models using frameworks like TensorFlow, PyTorch, and scikit-learn, resulting in a 30% reduction in processing time
  • Clearly communicated complex technical concepts to non-technical stakeholders and team members, improving cross-departmental understanding by 30%.

Senior Data Engineer

Johnson & Johnson | Icube Consulting Service
Jacksonville, FL
04.2023 - Current
  • Led successful data migration projects, reducing data redundancy by 50% and enhancing data availability by 20%
  • Enhanced Big Data workflows through AWS S3, Lambda, Step Functions, and EMR cluster integration, resulting in a 50% decrease in data transfer time and 30% lower processing costs
  • Coordinated teams and stakeholders, improving project timelines by 15% and reducing conflicts by 25%
  • Received an average stakeholder satisfaction rating of 9.5/10
  • Streamlined data processing with a 5-step automated workflow, reducing manual intervention by 80% and improving system performance by 25%
  • Improved data accessibility and integration efficiency by 40% by implementing processes for data ingestion and integration using GCP and AWS, enhancing the company's data infrastructure
  • Ensured data accuracy and consistency with a 99.9% success rate by developing and managing ETL pipelines to ingest data from various sources
  • Streamlined Snowflake queries and warehouse configurations, improving overall query performance by 40%
  • Designed Snowflake schemas, cutting storage costs by 30% with SQL and AWS
  • Improved data processing efficiency by 30% and reliability by 25% by utilizing Apache Airflow for orchestrating complex workflows
  • Developed and deployed 10+ AI/ML models using frameworks like TensorFlow, PyTorch, and scikit-learn, resulting in a 30% reduction in processing time
  • Clearly communicated complex technical concepts to non-technical stakeholders and team members, improving cross-departmental understanding by 30%.

Senior Data Engineer

American Tire Distributor | Miracle System Software
Jacksonville, FL
11.2021 - 03.2023
  • Orchestrated data quality initiatives by implementing Grafana checks using the Kibana dashboard, leading to a seamless and successful tire company launch with a 20% reduction in data discrepancies
  • Refined data quality by meticulously inserting data using Python, resulting in 15% increase in data accuracy and facilitating seamless analysis in GCP's BigQuery environment
  • Utilized Pub/Sub for real-time data streaming, enabling instant data updates and achieving 30% faster analytics response times
  • Implemented Snowflake data sharing, reducing data duplication by 50% with AWS Glue and Python
  • Crafted a scalable GCP data lake, resulting in a 30% reduction in monthly storage expenses
  • Managed petabytes of structured JSON data with precision
  • Designed GCP data architecture, processing terabytes of tire sales data and uncovering insights across 1012 distributors, leading to 20% faster data retrieval
  • Enabled data-driven decision-making, increasing reporting efficiency by 35% by designing and optimizing the data architecture to support analytics and reporting requirements
  • Upgraded data processing performance through query optimization and efficient resource utilization, reducing processing time by 30%
  • Boosted project delivery time by 20% by working closely with cross-functional teams to understand data requirements and deliver solutions that meet business needs
  • Executed numerical computations on 1 million+ data points with NumPy and Panda speeding up calculations by 70%
  • Increased automation and system reliability by 30% by automating data workflows and implementing monitoring solutions to ensure data quality
  • Architected automated data pipelines with Apache Beam and Dataflow, cutting manual intervention and processing time by 70%
  • Resolved issues with a 95% success rate, reducing project delays by 20%
  • Identified and resolved issues with a 95% success rate, implementing solutions that lowered project delays by 20%.

Freelance Data Engineer

Dish Network | Tenxpert Inc.
Jacksonville, FL
01.2020 - 11.2021
  • Implemented real-time data streaming solutions with Pub/Sub and Dataflow, leveraging Python for data processing and achieving a 25% reduction in data latency, processing over 100 million events per day
  • Created and managed data warehouses on BigQuery, integrating complex SQL queries with Python for advanced analytics, resulting in a 35% increase in query performance
  • Optimized Snowflake SQL queries, reducing execution time by 60%.

Data Engineer

Barclays Bank | Accenture Ltd
Pune, Maharashtra
05.2017 - 02.2019
  • Enhanced data processing efficiency by 40% and system scalability by utilizing Python for AWS Lambda functions and automation scripts
  • Developed 20+ ETL pipelines in Snowflake, ensuring 99.9% uptime with AWS Lambda and Python
  • Built and maintained relationships with over 50 clients and team members, resolving conflicts with a 90% satisfaction rate and introducing innovative approaches that enhanced project efficiency by 15%.

Data Engineer

Fiserv Ltd
Pune, Maharashtra
03.2018 - 01.2019
  • Developed Apache Spark data pipelines, utilizing AWS S3's JSON raw bucket, intelligently converting to Parquet to reduce processing time by 70%
  • Seamlessly integrated data frames using boto3, achieving a 95% boost in data processing speed and enriched insights
  • Monitored and improved EMR performance through AWS CloudWatch, introducing custom logs for memory issue detection
  • This customization resulted in 40% faster issue resolution and fine-tuned resource allocation efficiency
  • Adapted to new technologies and project requirements, managing up to 5 concurrent projects and meeting 100% of deadlines
  • Led the adoption of scalable data pipelines using Apache Spark on AWS, processing an average of 100 million JSON records per day and converting them into Parquet files and used data frames resulting in 50% reduction in data processing time
  • Employed AWS Redshift for data warehousing, leading to a 30% optimize query performance and reducing data retrieval time by 50%, resulting in accelerated business analytics and tableau reporting processes.

Data Engineer

Applied Optimization Pvt Ltd
Pune, Maharashtra
02.2015 - 07.2016
  • Enabled real-time data streaming, improving data flow by 30%, and lessened data transfer time by 50% by implementing AWS services including Kafka and AWS DataSync for efficient data transfer
  • Streamlined infrastructure management and improved deployment consistency by 25% by utilizing Terraform for infrastructure as code.

Data Engineer

Sigma Infotech
Bhopal, Madhya Pradesh
08.2010 - 07.2014
  • Improved data processing efficiency by 50% using AWS Glue and Apache Spark
  • Reduced processing time by 70% using Apache Spark data pipelines
  • Implemented vectorized operations in NumPy, Panda boosting processing speed by 60% with Python and AWS.

Education

Master of Science in Computer Science -

University of Central Missouri
Warrensburg, MO
01.2020

Bachelor of Engineering -

Chhattisgarh Swami Vivekanand Technical University
Raipur, Chhattisgarh
01.2010

Skills

Programming Languages: Python, SQL, Java, Scala, R, and C#

Cloud Technologies: AWS: EC2, S3, Lambda, Glue, Data Pipeline, Kinesis;

Google Cloud Platform: BigQuery, Bigtable, Pub/Sub

Databases and Data Warehousing: Postgres, RDS, Redshift, Snowflake, DynamoDB, DBT

Big Data Technologies: Hadoop, Apache Spark (EMR), Apache Kafka, Apache HBase, Apache Hive Data: Data Analytics, Data Visualization, Data Modeling, Data Processing, Data Pipeline, Data Management

Machine Learning Algorithms: Regression, Classification, Clustering, Decision Trees, Random Forests, Neural Networks, and Vertex AI

Data Visualization: Tableau, Power BI, Matplotlib, SSRS, Crystal Reporting

Orchestration and ETL tools: Apache Airflow, Docker and Kubernetes, Talend, SSIS

Data Security and Governance: HIPAA Compliance

Project Management: Service Now, Jira Management, Kibana, Incident Management, Agile Methodologies Leadership: Problem-solving, Collaborative, Innovative

Certification

  • Google Certified Professional Data Engineer, 06/01/22
  • AWS Certified Solutions Architect, 07/01/24

Timeline

Senior Data Engineer

Contract Jobs - Jacksonville, FL
04.2023 - Current

Senior Data Engineer

Johnson & Johnson | Icube Consulting Service
04.2023 - Current

Senior Data Engineer

American Tire Distributor | Miracle System Software
11.2021 - 03.2023

Freelance Data Engineer

Dish Network | Tenxpert Inc.
01.2020 - 11.2021

Data Engineer

Fiserv Ltd
03.2018 - 01.2019

Data Engineer

Barclays Bank | Accenture Ltd
05.2017 - 02.2019

Data Engineer

Applied Optimization Pvt Ltd
02.2015 - 07.2016

Data Engineer

Sigma Infotech
08.2010 - 07.2014

Master of Science in Computer Science -

University of Central Missouri

Bachelor of Engineering -

Chhattisgarh Swami Vivekanand Technical University
MEDHA VIJAYVARGIA