Summary
Overview
Work History
Education
Skills
Timeline
Generic

Surya Tej Katamreddy

Austin,USA

Summary

MLOps Engineer with 11+ years of experience in data engineering and machine learning operations. Skilled in building scalable MLOps solutions on cloud platforms, creating automated CI/CD pipelines, managing model lifecycle with MLflow, and ensuring robust data pipelines for seamless model training and deployment. Strong background in Databricks, Azure, and AWS, with expertise in Spark, Kafka, and real-time data processing.

Overview

14
14
years of professional experience

Work History

Data Engineer, MLOps Implementation

Optum
11.2020 - 12.2023
  • Implemented MLOps workflows with Azure Data Factory and Databricks, establishing automated data processing and model training
  • Integrated MLflow and Model Registry, streamlining model tracking, experiment management, and inference for production models
  • Set up CI/CD in Azure DevOps for automatic model updates, container deployments, and API configurations for machine learning services
  • Supported low-latency data processing for models through real-time data ingestion using Kafka and Spark

MLOps Engineer

CCS Medical
01.2023 - 05.2023
  • Orchestrated end-to-end MLOps pipelines using Databricks, enabling data flow, feature engineering, model training, and deployment
  • Automated model deployment with CI/CD pipelines in Azure DevOps, enhancing model reproducibility and tracking with MLflow
  • Developed REST API endpoints for model inference, allowing real-time predictions and external application integration
  • Built robust data pipelines with Azure Data Lake and Spark Streaming for real-time data ingestion and transformation

Senior Data Engineer

HealthPartners
05.2018 - 10.2020
  • Developed data pipelines in AWS using Glue, Lambda, and Redshift for machine learning models
  • Enhanced model performance through optimized feature engineering with Spark and Redshift
  • Implemented CI/CD on AWS for model deployments and used CloudWatch for monitoring data pipeline health and model performance

ETL Developer

TIAA-CREF
01.2015 - 09.2016
  • Migrated MapReduce jobs to Spark to improve data processing efficiency
  • Integrated Kafka with Spark Streaming for real-time data ingestion and transformation
  • Built ETL solutions with SSIS, Spark, and Hadoop, ensuring efficient data movement to support analytics and reporting

ETL Developer

Trianz
04.2014 - 01.2015
  • Created end-to-end ETL workflows using Talend and SQL for data migration and transformation
  • Designed and executed ETL solutions from various sources to data marts, utilizing Talend and complex SQL transformations

ETL Developer

DXC Technology
06.2010 - 04.2014
  • Developed SSIS packages for ETL processes and data transformations, ensuring smooth data flow across systems
  • Enhanced SSIS performance through package optimization, error handling, and incremental load strategies

Education

Master’s - Computer And Information Systems

SMUMN
08-2016

Bachelor’s - Computer Science

SVEC
03-2010

Skills

  • Python
  • SQL
  • Scala
  • Bash
  • Databricks
  • MLflow
  • Azure Machine Learning
  • CI/CD
  • Azure DevOps
  • GitHub Actions
  • Azure Data Lake
  • Azure Data Factory
  • Azure Synapse
  • Spark
  • Hadoop
  • AWS
  • EMR
  • Redshift
  • Apache Kafka
  • Spark Streaming
  • Azure Logic Apps
  • Sqoop
  • Scikit-learn
  • TensorFlow
  • PyTorch
  • Model Registry
  • Relational databases
  • Advanced analytics
  • SQL transactional replications
  • Python Programming
  • Data Modeling

Timeline

MLOps Engineer

CCS Medical
01.2023 - 05.2023

Data Engineer, MLOps Implementation

Optum
11.2020 - 12.2023

Senior Data Engineer

HealthPartners
05.2018 - 10.2020

ETL Developer

TIAA-CREF
01.2015 - 09.2016

ETL Developer

Trianz
04.2014 - 01.2015

ETL Developer

DXC Technology
06.2010 - 04.2014

Master’s - Computer And Information Systems

SMUMN

Bachelor’s - Computer Science

SVEC
Surya Tej Katamreddy