Summary
Overview
Work History
Education
Skills
Academic Projects
Certifications
Timeline
Generic

Vishnu Vardhan Pachunoori

Plano,TX

Summary

Data Engineer with 1.5 years of experience as a data engineer planning, constructing, and optimising infrastructure and data pipelines. knowledgeable about a wide range of data technologies and programming languages, with practical experience in database administration, ETL development, and data modelling. I'm eager to use my knowledge and experience to support creative ideas and promote corporate success.

Overview

3
3
years of professional experience

Work History

Big Data Developer Trainee

Solwin Technoloies
Plano, TX
01.2024 - Current
  • Participated in an intensive training program focused on big data technologies, including Hadoop, Spark, Kafka, Hive, and HBase, gaining hands-on experience with each.
  • Developed and implemented data processing pipelines using Python, leveraging distributed computing frameworks like Spark to analyze large-scale datasets efficiently.
  • Collaborated with experienced professionals to understand business requirements and translate them into technical solutions.

Data Engineer

SMG Infotech
Hyderabad, Telangana
01.2021 - 07.2022
  • Designed and implemented a distributed data processing system using Apache Hadoop, increasing data processing speed by 50%.
  • Optimized SQL queries and database performance, reducing query execution time by 40%.
  • Developed and maintained data models and schemas for efficient storage and retrieval of structured and unstructured data.
  • Designed and implemented data warehousing solutions on cloud platforms, enabling flexible and cost-effective storage and analysis of large datasets.
  • Collaborated with software engineers to integrate data processing pipelines within a cloud infrastructure resulting in a 40% reduction in infrastructure costs.

Education

Master of Science - Computer Science

University of Central Missouri
Warrensburg, MO
05-2024

Bachelor of Science - Information Technology

Gokaraju Rangaraju Institute of Engineering & Tech
Hyderabad
05-2022

Skills

  • SQL and Database Management
  • Python
  • Data Warehousing
  • ETL (Extract, Transform, Load) Tools
  • Hadoop
  • Apache Spark
  • Database Management
  • Machine Learning Algorithms
  • AWS, Google Cloud Platforms
  • Mathematical Logic
  • Critical thinking
  • Analytical Skills

Academic Projects

  • Quantifying COVID-19 Content: We collect the news content from various online sources in the form of comments. This model processes the data from an imported dataset (comments) and results in a report analytics. We pass this to a model which classifies the data as pro vax and anti vax.

overall, this approach shows that a machine-learning algorithm, the LDA algorithm, plausible topics within collections of posts from online communities surrounding the vaccine and COVID-19 debate. In addition to being able to handle large quantities of data, its results emerge quickly using statistical grouping techniques, instead of having to rely on potentially biased, slow and costly human labeling.

  • Android Malware Detection: •Android platform due to open sourcecharacteristic and Google backing has the largest global market share. It has drawn the attention of cyber criminals operating particularly through wide distribution of malicious applications. This paper proposes an effectual machine-learning based approach for Android Malware Detection making use of evolutionary Genetic algorithm for discriminatory feature selection. Selected features from Genetic algorithm are used to train machine learning classifiers and their capability in identification of Malware before and after feature selection is compared. The experimentation results validate that Genetic algorithm gives most optimized feature subset helping in reduction of feature dimension to less than half of the original feature-set. Classification accuracy of more than 94% is maintained post feature selection for the machine learning based classifiers, while working on much reduced feature dimension, thereby, having a positive impact on computational complexity of learning classifiers.

Certifications

  • Certification in The Joy of Computing Using Python in NPTEL(National Programme on Technology Enhanced Learning)
  • Online Certification in DevNet Associate(Cisco Networking Academy)

Timeline

Big Data Developer Trainee

Solwin Technoloies
01.2024 - Current

Data Engineer

SMG Infotech
01.2021 - 07.2022

Master of Science - Computer Science

University of Central Missouri

Bachelor of Science - Information Technology

Gokaraju Rangaraju Institute of Engineering & Tech
Vishnu Vardhan Pachunoori