Summary
Overview
Work History
Education
Skills
Websites
Environment
Certification
Languages
Timeline
Generic

Vinay Muthyam

Peoria,IL

Summary

Results-driven Data Analyst with extensive expertise in data analysis and engineering, dedicated to optimizing business processes and ensuring robust data governance. Proven track record of promoting sustainable community practices while maintaining compliance with technical and privacy standards. Recognized for strong work ethic, adaptability, and exceptional interpersonal skills, effectively collaborating in team environments, and mastering new technologies swiftly.

Overview

1
1
Certification

Work History

  • Worked on complete Big Data flow of the application starting from data ingestion upstream to HDFS, processing the data in HDFS and analyzing the data and involved using Python
  • Developed Python scripts for regular expression (regex) project in the Hadoop/Hive environment
  • Developed Spark code using Scala and Spark-SQL/Streaming for faster testing and processing of data
  • Used Spark-Streaming APIs to perform necessary transformations and actions on the data obtained from Kafka
  • Used Spark-SQL to Load JSON data and create Schema RDD and loaded it into Hive Tables and handled structured data using Spark SQL
  • Used AWS glue catalog with crawler to get the data from S3 and perform SQL query operations
  • Worked on Airflow 1.8 (Python2) and Airflow 1.9 (Python3) for orchestration and familiar with building custom Airflow operators and orchestration of workflows with dependencies involving multi-clouds
  • Utilized Flume to filter out the input data read to retrieve only the data needed to perform analytics by implementing flume interception
  • Tested Apache Airflow for building high performance batch and interactive data processing applications
  • Build strong relationships with cross-functional teams and external stakeholders, facilitating active collaboration and contributing to the success of data migration projects
  • Implemented Real time analytics on Cassandra data using thrift API
  • Designed columnar families in Cassandra and Ingested data from RDBMS, performed transformations and exported the data to Cassandra
  • Loaded data from UNIX file system to HDFS using Shell Scripting
  • Environment: Python, Spark, Hadoop (HDFS, Map Reduce), Hive, VMware, Cassandra, Sqoop Airflow, spring, Oozie, AWS Services EC2, S3, unix Shell Scripting

Education

Master of Science - Computer Science

Campbellsville University
Louisville, KY
08-2024

Bachelor of Science - Electronics and Communication Engineering (ECE)

K L E F Deemed To Be University
Guntur,india
03-2022

Skills

  • Python
  • Java
  • C Programming
  • Sci-kit-learn
  • Jupyter Notebook
  • Weka
  • GitHub
  • SQL
  • Spark
  • MySQL
  • RDBMS
  • Azure DevOps
  • Snowflake
  • Tableau
  • Excel
  • Data Architecture
  • ETL
  • PowerPoint
  • Data security
  • Data extraction
  • Data mining
  • Microsoft Office
  • Microsoft Excel
  • Organization
  • Communication
  • Active listening
  • Multitasking Abilities

Environment

  • Python
  • Spark
  • Hadoop (HDFS, Map Reduce)
  • VMware
  • Sqoop
  • Airflow
  • Spring
  • AWS Services EC2
  • S3
  • Unix Shell Scripting

Certification

  • machine learning
  • Introduction to Artificial Intelligence (AI) (IBM)
  • IT Fundamentals for Everyone (IBM)
  • Basic Spanish Vocabulary
  • Information Theory

Languages

English
Full Professional
Spanish
Limited
French
Limited

Timeline

Master of Science - Computer Science

Campbellsville University

Bachelor of Science - Electronics and Communication Engineering (ECE)

K L E F Deemed To Be University
Vinay Muthyam