Results-driven Software Development Engineer with a Master’s in Data Science and hands-on experience building scalable data pipelines and analytical systems. Proficient in Python, SQL, cloud-native architectures, and DevOps practices.
Overview
2
2
years of professional experience
Work History
Software Development Intern
Shanti Switchgears Private Limited
Hyderabad, India
06.2022 - 05.2023
Engineered automation scripts in Python to streamline switchgear configuration data collection, improving data reliability.
Integrated data from SCADA systems into SQL Server databases, enabling unified monitoring of electrical parameters.
Developed dashboard prototypes using Power BI to visualize switchgear performance metrics for operations teams.
Software Development Intern
Trenzet Infra
Hyderabad, India
06.2021 - 05.2022
Developed Python scripts to automate data ingestion and preprocessing workflows, reducing manual effort and enhancing pipeline efficiency.
Built RESTful APIs with Flask to expose processed data for internal analytics tools without manual intervention.
Collaborated with the team to design and implement database schemas in MySQL for efficient data storage and retrieval.
Education
MS - Data Science
University of Maryland, Baltimore County
Baltimore
05.2024
B.Tech - Computer Science
GITAM University
Hyderabad
04.2022
Skills
Python
Java
R
SQL
Flask
Scikit-learn
TensorFlow
Transfer learning
Ensemble methods
AWS
Azure
Snowflake
BigQuery
Git
Docker
Jenkins
CI/CD
Agile
Scrum
Power BI
Tableau
Matplotlib
Projects
Disease Prediction Web App
Architected the backend in Django, defining RESTful endpoints for data ingestion, model training, and inference.
Implemented data preprocessing pipelines to clean and normalize patient records, handling missing values and outliers.
Integrated feature engineering modules (e.g. polynomial features, interaction terms) to improve model expressiveness.
Leveraged XGBoost’s built-in cross-validation and early stopping to optimize hyperparameters and prevent overfitting.
Containerized the application with Docker and deployed to AWS ECS, enabling scalable, fault-tolerant service.
Added role-based access control and encrypted sensitive data in transit and at rest to ensure HIPAA compliance.
Cricket T20 Analytics
Designed a scraper using BeautifulSoup and Selenium to collect ball-by-ball data, player stats, and match conditions from ESPN Cricinfo.
Performed k-means clustering on batting and bowling styles to segment players into strategic archetypes.
Implemented time-series analysis (rolling averages, exponential smoothing) to track form and momentum shifts.
Built interactive Power BI reports featuring slicers for pitch type, venue, and opposition, supporting scenario planning.
Developed a recommendation engine using collaborative filtering to suggest optimal batting orders based on opponent weaknesses.
Scheduled nightly data refreshes via Azure Data Factory to keep dashboards up to date.
Satellite Image Classification
Curated and labeled a custom dataset of multi-spectral satellite images, including augmentation (rotations, flips, color jitter).
Fine-tuned pre-trained VGG19 and InceptionV3 models, freezing early layers and retraining the top classifier layers.
Compared performance of transfer learning vs. training from scratch, documenting trade-offs in accuracy and training time.
Packaged the inference pipeline as a Flask microservice with GPU support, achieving sub-second response times.
Implemented logging and model versioning with MLflow to track experiments and facilitate rollbacks.
Visualized class activation maps (CAMs) to interpret model focus areas and validate decision reasoning.