Summary
Overview
Work History
Education
Skills
Projects
Timeline
Generic

Ajay Sagar Bokka

Seattle,WA

Summary

I'm a Software Development Engineer with over 2.5 years of experience, contributing to AWS CloudTrail Insights by delivering scalable and efficient solutions. I focus on automation, system optimization, and operational excellence, ensuring reliability and performance at scale. I’ve worked on enhancing system efficiency, automating workflows, and improving service resilience to reduce manual effort, and streamline operations. Passionate about solving complex engineering challenges, improving system architecture, and driving impactful innovations in fast-paced environments.

Overview

3
3
years of professional experience

Work History

Software Development Engineer

AWS - CloudTrail
Seattle, WA
10.2022 - Current
  • Built and launched the CloudTrail Insights feature in 5+ AWS regions, performing rigorous testing and validation, leading to a 60% increase in customer adoption.
  • Designed and implemented real-time event processing pipelines using Apache Flink, Amazon Kinesis, and DynamoDB, ensuring scalable, fault-tolerant, and deterministic data processing.
  • Automated on-call operational tasks, reducing manual interventions, and cutting engineering effort by 40%, leading to a 15% decrease in incident response time.
  • Addressed critical memory usage spikes in high-volume regions, which led to a daytime severity level 2 incident, posing a risk of memory exceeding 100%, potentially dropping attribution data, and impacting insights delivery. I took immediate action by increasing the severity of the ticket to accelerate resolution. Collaborated with the team to reassess shard allocation, considering prior increases in high-volume regions, and successfully stabilized memory usage to prevent data loss.
  • Upgraded all components to Java 17, significantly enhancing efficiency and optimizing memory usage across regions. This migration improved HeapSpace and MemoryUsage, ensuring better resource utilization, and system stability. Successfully delivered a seamless transition with enhanced performance.
  • Automated the handling of production resources during large-scale events by onboarding multiple Python and Bash scripts, ensuring system safety, and reducing manual intervention effort by 40%.
  • Enhanced Model Configuration with density information to improve insight compaction in the SSF algorithm for ACR Dense. Modified key components to pass DatasetType information, and refine the compaction logic, ensuring more accurate insight generation and processing.
  • Acted as Scrum Master, leading sprint planning, coordinating with multiple teams, and addressing on-call incidents to maintain service reliability.

Account-Level Backfilling System Enhancement.

  • Implemented message handling components to differentiate between API-level and account-level backfill requests, ensuring a smooth transition during the system upgrade.
  • Engineered test cases and performed load testing to validate system behavior under various load conditions.
  • Integrated monitoring solutions to track key metrics, including: message processing rates, backfill completion times, error rates, and system health indicators.
  • Extended the existing codebase to support new message formats while maintaining backward compatibility.

Real-Time Event Processing with Apache Flink and Amazon Kinesis.

  • Developed a real-time data pipeline using Apache Flink to process millions of events from Amazon Kinesis.
  • Implemented event-time processing with tumbling windows for correct ordering and late-event handling.
  • Applied distributed staggering to prevent simultaneous window closures, reducing peak computational load.
  • Used Amazon DynamoDB for storage, and optimized batch writes for cost efficiency.
  • Ensured fault-tolerant processing using Flink checkpointing and recovery mechanisms.

Region Expansion:

  • Manually built and launched the CloudTrail Insight feature in 5+ AWS Regions.
  • Performed infrastructure deployments and built components with modified system configuration as per the region's guidelines.
  • Validated the Insights Workflow and public APIs using Bash scripts.
  • Automated the region expansion workflow to reduce engineers' effort by 40%.

Education

Master of Science - Business Analytics

Univeristy of Texas At Arlington
Arlington, TX
08-2022

Bachelor of Science - B.Tech in Electrical And Electronics Engineering

Mahatma Gandhi Institute of Technology
Hyderabad, India
09-2020

Skills

  • Programming Languages: Python (Pandas, Scikit-learn, TensorFlow, Keras), Java, Ruby, Nodejs, TypeScript, YAML, R, MATLAB
  • Frameworks & Tools: Spring Boot, JUnit, Mockito, Jenkins, GitHub, Docker, Kubernetes
  • Databases: MySQL, SQLite, MongoDB, DynamoDB
  • Cloud Platforms: Hands-on experience with AWS (S3, Lambda, SQS, EC2, CloudWatch, Kinesis, ECS, ElastiCache); Familiar with GCP and Azure
  • DevOps & CI/CD: Jenkins, GitHub Actions, Bamboo, Linux, Docker, Kubernetes
  • Big Data & Stream Processing: Apache Flink (real-time processing), Familiar with Kafka
  • Development Practices: Test-driven development (TDD), Agile methodologies (Scrum), Code reviews, Functional and performance testing
  • Software Engineering: Object-Oriented Programming (OOP), Debugging, Microservices, Distributed systems
  • Data Visualization & Analytics: Tableau, Power BI, Talend, Microsoft Excel, SAP, SAS
  • Machine Learning & Data Science: Model development, Data modeling, Preprocessing

Projects

Regression Data Challenge in Kaggle     

  • To predict the target variable that captures the medical condition of a patient.
  • Implemented regression techniques such as decision trees, random forest and external gradient boosting and got better accuracy with random forest technique.

Marketing Campaign-Insights and Solutions

  • Finding the factors effecting the most the previous campaign and finding the week spots from the data.
  • Visualizing the data to depict the average customer choices and finding there attributes.
  • Providing insights from the obtained results and finding new spots to focus.

Web Scrapping and Data Visualization Using Python

  • Extracting details of different sub levels from the website.
  • Merging the data ,creating dictionaries and visualizing the data are performed.
  • Created a database sheet using SQL in python.

Image Detection from Fashion MNIST- Dataset Using Python and Machine Learning Models

  • Training and testing the data and performing dimension reductions and preprocessing.
  • Performing different machine learning models to analyze.
  • Use Own images to predict with the use of the best models.
  • Modules Used : Scikit-learn, matlplotlib, pandas, numpy, Pillow, CV .

Covid19 Data Analysis using Python

  • Extracting COVID19 datasets, preparing it for analysis and aggregating rows.
  • Merging datasets and finding correlations among the data
  • Visualizing the analysis results using Seaborn

Timeline

Software Development Engineer

AWS - CloudTrail
10.2022 - Current

Master of Science - Business Analytics

Univeristy of Texas At Arlington

Bachelor of Science - B.Tech in Electrical And Electronics Engineering

Mahatma Gandhi Institute of Technology
Ajay Sagar Bokka