Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Sameer Devulapalli

Summary

Data Engineer with a track record of developing scalable ETL processes and optimizing machine learning models through Python and Azure Data Factory. Proven expertise in troubleshooting and enhancing data pipeline performance, delivering impactful insights through strong analytical skills. Committed to leveraging data to drive decision-making and improve operational efficiencies.

Overview

5
5
years of professional experience
1
1
Certification

Work History

Data Engineer Intern

SmartBots AI
Dallas, TX
10.2023 - Current
  • Developed scalable ETL processes using Azure Data Factory to ingest, cleanse, and transform large datasets from multiple sources, including customer interaction logs, CRM systems, and billing data.
  • Assisted in preprocessing and feature engineering of raw datasets using Python and PySpark, contributing to the development of machine learning churn prediction models.
  • Acted as the first line of support for troubleshooting issues related to data ingestion, API integrations, and pipeline performance, ensuring minimal downtime and quick resolution.
  • Created batch pipelines in Azure Data Factory (ADF) by configuring linked services, integration runtime, to extract, transform, and load data from different sources into Azure Data Lake.
  • Assisted in creating real-time streaming pipelines using Azure IoT Hub, Azure Event Hub, and Spark structured streaming.
  • Collaborated with data scientists to deploy and automate the retraining of the churn prediction model using Azure ML, improving predictive accuracy.
  • Implemented and fine-tuned logging applications (e.g., Azure Monitor) to aggregate customer interaction logs and pipeline errors, enabling proactive detection and resolution of system vulnerabilities and performance bottlenecks.
  • Implemented diagnostic tools and automated scripts to identify and troubleshoot data discrepancies, ensuring timely resolution of issues impacting customer experience.

Data Engineer/ Business Analyst Intern

PALNIES
Hyderabad, India
12.2019 - 06.2024
  • Assisted in designing and implementing data pipelines for collecting and analyzing customer support data, including web and chat interactions.
  • Assisted in analyzing data flow disruptions and system performance using diagnostic tools, ensuring the operation of automated data pipelines, and delivering accurate insights to business teams.
  • Worked closely with clients and business units to gather requirements, troubleshoot issues with data pipelines, and ensure timely support for system issues.
  • Implemented automated security audits and system health checks across servers, detecting unauthorized access or vulnerabilities in real time, ensuring the reliability and performance of customer data pipelines.
  • Used JavaScript and SQL queries to troubleshoot data issues, assisting in resolving complex technical problems related to customer support systems

Education

Master of Science - Management

New Jersey Institute of Technology
Newark, NJ

Skills

  • Programming languages: Python, Java, and JavaScript
  • Databases and Query Languages: SQL, SQL Server, and Azure SQL Database
  • Cloud Technologies: Microsoft Azure (Azure Data Factory, Azure Synapse Analytics, Azure Data Lake, Azure Monitor, Azure Databricks)
  • Big Data Technologies: Apache Spark, Apache Airflow
  • Data Warehousing: SQL Server, Azure Synapse Analytics
  • Problem-Solving and Troubleshooting: Debugging tools, Diagnostic scripting (Python, JavaScript)

Certification

  • Agile Organization
  • Data – Driven Decisions with Power BI
  • Scrum Master Certification: Scrum Methodologies
  • Introduction to Machine Learning

Timeline

Data Engineer Intern

SmartBots AI
10.2023 - Current

Data Engineer/ Business Analyst Intern

PALNIES
12.2019 - 06.2024

Master of Science - Management

New Jersey Institute of Technology
Sameer Devulapalli