Summary
Overview
Work History
Education
Skills
Timeline
Generic

Jayateja V

Dallas,TX

Summary

Accomplished Data Engineer with 4 years of experience building scalable data pipelines, ETL workflows, and real-time streaming systems across AWS and Azure. Proficient in Apache Spark, Kafka, Airflow, and cloud-native services (AWS Glue, Azure Data Factory). Expertise in Redshift, Synapse, and Power BI for data warehousing and BI. Experienced with HIPAA/GDPR-compliant architectures and cross-functional collaboration to deliver data-driven insights.

Overview

5
5
years of professional experience

Work History

Data Engineer

Amdocs Inc
10.2024 - Current
  • Built scalable, serverless pipelines using AWS Glue, Lambda, and Kinesis to process telecom billing data in near-real-time, reducing latency by 40%
  • Designed partitioned Redshift schemas and integrated API Gateway endpoints, improving downstream analytics performance by 35%
  • Developed validation layers in Python, cutting billing discrepancies by 30% and ensuring regulatory compliance with telecom standards
  • Automated ingestion-to-delivery pipeline for CRM and CDR sources, reducing manual interventions by 90%
  • Established CloudWatch alerts and S3 backup workflows, achieving >99.9% data reliability and meeting retention policies
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • .Fine-tuned query performance and optimized database structures for faster, more accurate data retrieval and reporting.

Data Engineer

Dish Networks
01.2024 - 09.2024
  • Led the end-to-end migration of legacy systems to Azure Data Lake, transferring over 20TB of structured and unstructured data with zero downtime
  • Automated complex ETL pipelines using Azure Data Factory, reducing manual data processing time by 60%
  • Architected a centralized data lake and integrated Synapse and Databricks, accelerating analytics delivery by 45%
  • Built real-time ingestion pipelines with Event Hubs and Stream Analytics, enabling marketing and operational dashboards with
  • Designed and deployed Power BI dashboards with Dataflows and Synapse backend, improving business reporting turnaround time by 50%
  • Enforced GDPR-compliant access control via Azure AD RBAC, enhancing security and auditability across all data assets

Data Engineer

Ulab Systems
03.2023 - 12.2023
  • Spearheaded the design of fault-tolerant ETL pipelines using Apache Airflow and Python to ingest and normalize data from EHR, lab systems, and internal tools—reducing pipeline failures by 40%
    • Architected a real-time data ingestion framework using Kafka, enabling immediate capture of patient events and improving system responsiveness by 25%
    • Developed secure, HIPAA-compliant data models in PostgreSQL, incorporating encryption, masking, and access controls, supporting audits with zero compliance issues
    • Enabled Change Data Capture (CDC) from clinical databases, reducing full ETL runtime by 35% and ensuring timely delivery of refreshed datasets
    • Built Python-based APIs and microservices to expose curated datasets for reporting and dashboards, improving analyst productivity and reducing reporting lag from 2 hours to under 15 minutes
    • Integrated Elastic Stack and Airflow alerting for end-to-end observability, increasing pipeline uptime to 99.8% and reducing incident resolution time by 60%

Data Analyst

Propack Industries
10.2020 - 08.2022
  • Developed automated Tableau dashboards and Excel reports for claim anomaly detection, reducing fraudulent claims by 20%
  • Automated daily metric reporting from Oracle SQL, improving operational reporting efficiency by 40%
  • Collaborated with dev teams to design JDBC-integrated local databases, reducing front-end response time by 30%
  • Implemented machine learning models (Random Forest, Logistic Regression) to predict fraudulent insurance claims with >85% accuracy
  • Contributed to EDI workflow optimizations, preventing claim duplication errors and saving $35M in failed transactions
  • Authored technical documentation and process manuals, accelerating onboarding and knowledge transfer for support teams

Education

M.S. - Advanced Data Analytics

University of North Texas
Dallas, TX
12.2023

B.Tech - Mechanical Engineering

B.V. Raju Institute of Technology
03-2021

Skills

    Programming Languages:
    Python, SQL, Shell Scripting, R

    Cloud Platforms:
    AWS (Glue, Lambda, Redshift, S3, Kinesis, API Gateway),
    Azure (Data Factory, Synapse Analytics, Databricks, Data Lake Storage, Event Hubs, Stream Analytics, Active Directory, Security Center)

    Big Data & ETL Frameworks:
    Apache Spark (PySpark), Apache Kafka, Apache Airflow, AWS Glue, Azure Data Factory, Change Data Capture (CDC), WTX

    Data Warehousing:
    Amazon Redshift, Azure Synapse, PostgreSQL, SQL Server, Oracle SQL

    Streaming & Real-Time Processing:
    Apache Kafka, Amazon Kinesis, Azure Event Hubs, Azure Stream Analytics

    APIs & Microservices:
    REST APIs, Python-based Microservices, AWS API Gateway

    DevOps & Monitoring:
    Git, Linux, Azure Monitor, Airflow Monitoring, Elastic Stack

    Security & Compliance:
    HIPAA, GDPR, RBAC, Data Encryption, Data Masking, Access Control, Azure Security Center

    Business Intelligence & Visualization:
    Power BI, Tableau, Excel

    Machine Learning & Analytics:
    Spark MLlib, Scikit-learn (Random Forest, Logistic Regression), Exploratory Data Analysis (EDA), Anomaly Detection

Timeline

Data Engineer

Amdocs Inc
10.2024 - Current

Data Engineer

Dish Networks
01.2024 - 09.2024

Data Engineer

Ulab Systems
03.2023 - 12.2023

Data Analyst

Propack Industries
10.2020 - 08.2022

M.S. - Advanced Data Analytics

University of North Texas

B.Tech - Mechanical Engineering

B.V. Raju Institute of Technology
Jayateja V