Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Harshitha Tallapally

Cincinnati,OH

Summary

Data Engineer with over 2 years of experience specializing in data engineering tasks and creating efficient data pipelines. Proficient in programming languages such as Python, R, and SQL for data manipulation and analysis. Skilled in using reporting tools like Power BI and Tableau to visualize and present data in a meaningful and interactive manner. Experienced in utilizing libraries including Pandas, NUMPY, SEABORN, GGPLOT, MATPLOTLIB, and PLOTLY, for data manipulation and visualization. Strong understanding of big data processing frameworks such as Data-bricks, Apache Spark, and Hadoop for large scale data processing and analysis. Proficient in working with databases such as MongoDB, MySQL, Oracle, and Google Big Query for designing and optimizing database structures and implementing ETL processes. Knowledgeable in machine learning libraries such as TensorFlow, KERAS, PyTorch, and SCIKIT-learn for building and deploying machine learning models. Familiarity with cloud technologies, specifically AWS, for leveraging cloud-based resources for data storage, processing, and deployment. Experienced in using Python IDEs like Py-Charm and JUPYTER Notebook for development and collaboration purposes. Proficient in statistical analysis tools including SAS, SPSS, and STATA for conducting advanced statistical analyses. Skilled in version control systems like Git and SVN for collaboration, code management, and tracking changes. Familiarity with ETL tools such as SSIS and Alteryx for streamlining data integration, transformation, and loading processes. Responsive expert experienced in monitoring database performance, troubleshooting issues and optimizing database environment. Possesses strong analytical skills, excellent problem-solving abilities, and deep understanding of database technologies and systems. Equally confident working independently and collaboratively as needed and utilizing excellent communication skills. Organized and dependable candidate successful at managing multiple priorities with a positive attitude. Willingness to take on added responsibilities to meet team goals. Organized and dependable candidate successful at managing multiple priorities with a positive attitude. Willingness to take on added responsibilities to meet team goals. Detail-oriented team player with strong organizational skills. Ability to handle multiple projects simultaneously with a high degree of accuracy.

Overview

2
2
years of professional experience
1
1
Certification

Work History

Data Engineer

Analytics India Pvt Ltd
08.2021 - 07.2022
  • Collaborated with cross-functional teams to design and develop scalable data pipelines and ETL workflows, ensuring efficient data extraction, transformation, and loading processes
  • Implemented robust data integration solutions using Python, SQL, and ETL tools such as SSIS and Alteryx, resulting in improved data accuracy and reduced processing time
  • Worked closely with data scientists and analysts to understand data requirements and provide optimized data structures for analytics and reporting purposes
  • Developed and maintained data models, schemas, and database architecture using technologies like MongoDB, MySQL, Oracle, and Google Big Query
  • Implemented and optimized data quality checks, data validation rules, and error handling mechanisms to ensure data integrity and reliability
  • Utilized big data processing frameworks like Data bricks, Apache Spark, and Hadoop to handle large-scale data processing and analysis
  • Collaborated with infrastructure team to design and optimize data storage solutions, leveraging cloud technologies like AWS and Google Cloud Platform
  • Conducted performance tuning and optimization of data workflows and database queries, resulting in improved efficiency and reduced latency
  • Implemented and maintained version control systems such as Git and SVN for efficient code management and collaboration
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability
  • Generated detailed studies on potential third-party data handling solutions, verifying compliance with internal needs and stakeholder requirements
  • Analyzed complex data and identified anomalies, trends, and risks to provide useful insights to improve internal controls
  • Collaborated with system architects, design analysts and others to understand business and industry requirements
  • Environment: Python, SQL, SSIS, Alteryx, GIT, MongoDB, MYSQL, Oracle, Google Big query

Data Engineer

Arbot Analytics India Pvt - Ltd, solution's Pvt Ltd
08.2020 - 07.2021
  • Utilized Python and SQL for data manipulation, implementing data transformations, data cleaning, and data integration tasks
  • Conducted statistical analysis on datasets, identifying patterns, correlations, and outliers to provide valuable insights for decision-making
  • Assisted in implementation and fine-tuning of machine learning models using popular libraries such as SCIKIT learn, TensorFlow, and PyTorch
  • Contributed to data exploration and preprocessing tasks, including data cleaning, feature selection, and normalization, to ensure high-quality data inputs for analysis and modeling
  • Applied principles of linear algebra and calculus to perform mathematical operations and transformations on data matrices, facilitating advanced data analysis
  • Collaborated with cross-functional teams to understand business requirements and provide data-driven solutions to address specific challenges
  • Participated in NLP tasks, including text preprocessing, text representation (TF-IDF, word embeddings), sentiment analysis, and text classification using libraries such as NLTK and SPACY
  • Designed compliance frameworks for multi-site data warehousing efforts to verify conformity with state and federal data security guidelines
  • Analyzed complex data and identified anomalies, trends, and risks to provide useful insights to improve internal controls
  • Developed, implemented and maintained data analytics protocols, standards, and documentation
  • Designed advanced analytics ranging from descriptive to predictive models to machine learning techniques
  • Reviewed project requests describing database user needs to estimate time and cost required to accomplish projects
  • Established and secured enterprise-wide data analytics structures

Education

Master of Science - Information Technology

UNIVERSITY OF CINCINNATI
Cincinnati, OH
12.2023

Bachelor of Science - Mechanical Engineering

VIGNAN INSTITUTE OF TECHNOLOGY AND SCIENCE
INDIA
08.2021

Skills

  • TECHNICAL SKILLS:
  • Programming Languages: Python, R, SQL
  • Cloud Technologies: AWS
  • Reporting Tools: Power BI, Tableau
  • IDEs: Python IDEs (eg, PY-CHARM, JUYPTER Notebook Libraries: Pandas, NUMPY, BEAUTIFULSOUP, SEABOR, GGPLOT, MATPLOTLIB, PLOTY
  • Machine Learning Libraries: TensorFlow, KERAS, PYTORCH, SCIKIT-learn
  • STATISTICAL ANALYSIS TOOLS: SAS, SPSS, STATA
  • Big Data Processing Framework: Data-bricks, Apache Spark, Hadoop
  • Database Tools: MongoDB, MySQL
  • Oracle, Google Big Query
  • Version Control Systems: Git, SVN
  • ETL Tools: SSIS, Alteryx

Certification

AWS Academy Graduate- Aws Academy Cloud Foundations

Power BI

Advanced NOSQL for Data Science

Career Essentials in Generative AI by Microsoft and LinkedIn


Timeline

Data Engineer

Analytics India Pvt Ltd
08.2021 - 07.2022

Data Engineer

Arbot Analytics India Pvt - Ltd, solution's Pvt Ltd
08.2020 - 07.2021

Master of Science - Information Technology

UNIVERSITY OF CINCINNATI

Bachelor of Science - Mechanical Engineering

VIGNAN INSTITUTE OF TECHNOLOGY AND SCIENCE
Harshitha Tallapally