Summary
Overview
Work History
Education
Skills
Websites
References
Timeline
Generic
Hemanthkumar Vinnakota

Hemanthkumar Vinnakota

Mckinney,TX

Summary

With a solid foundation as a Data Analyst, my journey has evolved into that of a seasoned Data Engineer. My transition bridges the realms of data analysis and engineering, combining analytical finesse with technical prowess. Proficient in statistical analysis, data modeling, and Python programming, I now focus on architecting and optimizing data pipelines, ensuring the seamless flow of information across organizations. My unique perspective, born from years of dissecting data, emphasizes the strategic value of quality data as a vital asset. I'm poised to construct the very infrastructure that underpins data-driven decision-making, guiding organizations to excel in the data-driven era.

Overview

4
4
years of professional experience

Work History

Data Engineer

NIKE
03.2023 - Current
  • Analyzed business requirements, facilitating the planning and implementation phases of the OLAP project
  • Oversaw data mapping, data movement, interfaces, and analytics, ensuring data quality and alignment with project goals
  • Collaborated with project managers to address data-related issues, integration challenges, and compatibility concerns
  • Created and managed AWS EC2 instances and AWS EMR clusters for development and testing
  • Developed end-to-end Spark applications using Scala for data cleansing, validation, transformation, and summarization
  • Utilized Python and Spark SQL for efficient data processing, optimizing query performance and memory usage
  • Designed and implemented Simple to complex MapReduce Jobs using Hive and Pig
  • Led efforts to review and improve data architecture processes, policies, and technology vision, exploring emerging technologies
  • Leveraged Spark, Scala, and Python for querying and data preparation from diverse big data sources
  • Implemented pre-processing queries in Python for internal Spark jobs
  • Prepared informative Tableau dashboards summarizing Configuration, Quotes, Orders, and e-commerce data
  • Conducted Extract, Transform, and Load (ETL) processes using Spark Data Frames to integrate data from multiple sources (JSON, relational databases, etc.)
  • Generated complex JSON data structures to optimize data storage and access as per client requirements
  • Utilized SQL queries and tools for data analysis, profiling, and querying databases
  • Collaborated in an agile environment, participating in daily SCRUM meetings, sprint planning, showcases, and retrospectives
  • Ensured code quality through development, peer review, and bug fixing.

Data Analyst

CGI
12.2020 - 07.2021
  • Over a span of 2 years, led a sustained project focused on data validation, query optimization, and streamlined data processing within the Snowflake and Talend ecosystem, resulting in consistent up to 40% improvement in query performance
  • Implemented and maintained data validation workflows using Talend and Python scripts, ensuring continuous data consistency and reliability across Snowflake data repositories
  • Leveraged Snowflake's query optimization capabilities to fine-tune SQL queries, delivering sustained query performance enhancements and responsiveness over the duration
  • Integrated data validation and optimized query executions into Talend ETL workflows, automating data quality checks and query efficiency enhancements throughout the project
  • Demonstrated expertise in Snowflake for optimizing data warehousing solutions, including schema design, clustering, and materialized views, contributing to sustained improvements in data retrieval and storage efficiency
  • Consistently utilized Snowflake's integration with Talend for efficient data loading, transformation, and enrichment, maintaining streamlined data workflows
  • Proficiently processed and transformed data using Python, ensuring ongoing data quality and analysis readiness within the Snowflake and Talend ecosystem
  • Maintained consistency and best practices in cross-platform data processing and SQL querying across multiple data sources, leveraging Snowflake, Talend, and Python, ensuring sustained project success.

Education

Master of science - data science

SOUTHERN ARKANSAS UNIVERSITY
MAGNOLIA, ARKANSAS
12.2021

Bachelor of science - COMPUTER engineering

LOVELY PROFESSIONAL UNIVERSITY
PHAGWARA, PUNJAB
05.2020

Skills

  • Technical skills
  • Data Migration
  • Data Modeling
  • Data Security
  • Machine Learning
  • API Development
  • Performance Tuning
  • Scripting Languages
  • NoSQL Databases
  • Big data technologies
  • Data Warehousing
  • SQL transactional replications
  • Risk Analysis
  • Data Analysis
  • Database Design
  • SQL and Databases
  • Business Intelligence
  • Database Administration
  • Relational databases
  • Problem-Solving

References

Will be provided upon request.

Timeline

Data Engineer

NIKE
03.2023 - Current

Data Analyst

CGI
12.2020 - 07.2021

Master of science - data science

SOUTHERN ARKANSAS UNIVERSITY

Bachelor of science - COMPUTER engineering

LOVELY PROFESSIONAL UNIVERSITY
Hemanthkumar Vinnakota