Summary
Overview
Work History
Education
Skills
Certification
Interests
Timeline
Generic

Bharath Kumar Suroju

Data Engineer
Denton,TX

Summary

Enthusiastic Data Engineer and Data Analyst eager to contribute to team success through hard work, attention to detail and excellent organizational skills. Clear understanding of Big Data environments and Data analytics . Motivated to learn, grow and excel in Data Industry.

Overview

1
1
Certification
1
1
year of post-secondary education
4
4
years of professional experience

Work History

Data Engineer

Vista Applied Solutions Group Inc.
Plano, Texas
06.2020 - Current
  • Collaborated with Data Engineering team on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • Involved in complete Big Data flow of the application starting from data ingestion from upstream to HDFS, processing and analyzing the data in HDFS.
  • Involved in running all the hive scripts through hive, Hive on Spark and some through Spark SQL using Python and Scala.
  • Extracted the data from RDBMS (Oracle) to HDFS using Sqoop and used HBase for lookups.
  • Used git to check-in, and checkout code changes and used Jenkins to build the code in the edge node.
  • Worked on EMR to process data in hql from S3 storage.

Research Assistant

University Of North Texas
Denton, Texas
08.2019 - 05.2020
  • Researched information on Veris community data of Verizon Security Research & Cyber Intelligence Center under Dr.Hsia-Ching (Carrie) Chang and on Behavior analytic citation data under Nicole Bank.
  • Developed Veris community database on AWS RDS (Postgres) and performed exploratory data analysis, data wrangling, analytics to identify the hidden trends, patterns, insights using Python packages like Pandas, Numpy, Matplotlib and Seaborn.
  • With respect to Behavior analytic citation data, I extracted references from all the volumes of 9 flagship journals identified by our professor which are exclusively publishing behavior analytic research and supported by many behavior analytic organizations.
  • The citation data was extracted from various webpages through beautiful soup and done python based data manipulation, text analytics through NLP packages like NLTK, Spacy, Text blob.

Data Engineer

Infosys Ltd.
Hyderabad, Telangana
11.2015 - 12.2018
  • Developed Spark core and Spark SQL scripts for faster data processing.
  • · Used Hive to analyze the partitioned and bucketed data and compute various metrics for reporting.
  • · Implemented, tuned, and tested the model on AWS Lambda with the best performing algorithm and parameters.
  • Worked with NoSQL databases like MongoDB in making MongoDB tables to load expansive arrangements of semi structured data.
  • Integrated Kafka-Spark streaming for high efficiency throughput and reliability
  • Developed various applications in Python for reducing MIPS for Zos system.
  • Assessment of the inventory using Micro Focus Enterprise Analyzer, this was done to analyze the application inventory and suggest the modernization approach suitable whether to Re-host, Rewrite, Retire or Renew.

Education

Master of Science - Data Science

University Of North Texas
Denton, TX
01.2019 - 05.2020

Bachelor of Engineering Technology -

Jawaharlal Nehru Technological University

Skills

Programming SkillsPython, PySpark, Scala, Core Java, Linux Shell Scripting, HTML

undefined

Certification

AWS Certified Solutions Architect – Associate

Interests

Social activities, Volunteering/Fund raiser for the Friends Foundation Orphanage, Nature lover, Travelling

Timeline

Data Engineer

Vista Applied Solutions Group Inc.
06.2020 - Current

Research Assistant

University Of North Texas
08.2019 - 05.2020

Master of Science - Data Science

University Of North Texas
01.2019 - 05.2020

Data Engineer

Infosys Ltd.
11.2015 - 12.2018

Bachelor of Engineering Technology -

Jawaharlal Nehru Technological University
Bharath Kumar SurojuData Engineer