Summary
Overview
Work History
Education
Skills
Certification
Websites
Timeline
Generic
PRAVEEN JUTUR

PRAVEEN JUTUR

Frederick,MD

Summary

Seasoned Site Reliability Engineer with expertise in AWS, Azure, and GCP architecture. Enhances cloud security and observability through CI/CD automation and container orchestration. Experienced in developing automated ML pipelines and optimizing cloud infrastructure to ensure real-time observability and security compliance.

Overview

19
19
years of professional experience
4
4
Certifications

Work History

Site Reliability Engineer | DevSecOps Engineer

Amtrak
Washington, District of Columbia
08.2025 - Current
  • Engineered automated ML training pipelines using Python and AWS Step Functions to orchestrate data preprocessing and model training on AWS SageMaker, replacing manual script execution.
  • Designed and implemented end-to-end database activity automation using AWS Lambda, Step Functions, and Python scripts, streamlining maintenance processes.
  • Leading the deployment and orchestration of Apache Airflow on AWS (MWAA) to centralize the scheduling and execution of complex ETL pipelines, ensuring secure and scalable data processing.
  • Leveraged Amazon Bedrock to integrate generative AI capabilities into an internal application, utilizing foundation models to automate content summarization and generation.
  • Built observability solutions using AWS CloudWatch, Splunk, and OpenTelemetry; developed custom Python telemetry packages for application tracing.
  • Architected and deployed production-grade observability solutions across multi-cluster Kubernetes (EKS) environments, integrating OpenTelemetry collectors and daemonsets to ensure 100% visibility into containerized workloads and ephemeral infrastructure.
  • Automated lifecycle management of containerized applications using Helm and GitLab CI, enabling zero-downtime releases for mission-critical services through Canary and Rolling deployment strategies.
  • Applied comprehensive technical understanding of cloud-native ecosystems to design fault-tolerant, scalable EKS architectures.

Cloud Infrastructure Engineer | SRE

Morgan Stanley
Baltimore, Maryland
06.2021 - 09.2025
  • Developed high-performance inference services by integrating EKS-hosted Python microservices with AWS SageMaker endpoints to provide real-time security analytics and fraud detection.
  • Automated the deployment of GenAI-ready serverless infrastructure using AWS CloudFormation, defining reusable templates for Lambda, API Gateway, and granular IAM policies to ensure reproducible, least-privilege environments for AI-driven operational tools.
  • Engineered real-time observability solutions using Python and AWS CloudWatch to monitor SageMaker endpoint performance and detect statistical data drift in production banking transactions.
  • Standardized model serving and observability by developing custom Python middleware to collect and route OpenTelemetry metrics from SageMaker endpoints to centralized security monitors.
  • Developed automated model validation gatekeepers within CI/CD pipelines to programmatically verify SageMaker artifact integrity and performance before promotion to production EKS environments.
  • Implemented and audited IAM policies to enforce least-privilege access, ensuring service account permissions met strict compliance and security standards.
  • Maintained operational excellence during critical production failures by adapting rapidly and prioritizing effective resolution strategies.
  • Spearheaded risk and issue management by architecting self-healing infrastructure that mitigated manual incident interventions.
  • Developed and maintained a serverless application backend using AWS Lambda and API Gateway, processing thousands of requests daily with high availability and low latency.
  • Developed and maintained Infrastructure as Code (IaC) solutions using AWS CloudFormation and Terraform to provision and manage CodePipeline resources, ensuring pipeline consistency and scalability across microservices stacks
  • Automated provisioning and configuration of AWS infrastructure using Terraform, including EC2, VPC, IAM, RDS, and S3 resources, enabling consistent, repeatable, and version-controlled deployments across multiple environments.
  • Architected a robust alerting system in Python that leverages Elasticsearch aggregations to identify anomalies in transaction patterns and trigger automated model investigations.
  • Integrated Terraform with CI/CD pipelines (GitLab CI/Jenkins) to support automated infrastructure deployments and drift detection, improving deployment velocity and reducing manual overhead.
  • Engineered high-availability, active-active infrastructure across multi-region Kubernetes clusters, utilizing Multi-AZ deployments and automated failover mechanisms to support mission-critical workloads.
  • Standardized enterprise observability by deploying OpenTelemetry collectors as a vendor-agnostic data layer, enabling seamless trace correlation and metric ingestion across distributed architectures.
  • Led application deployment strategies using Helm, GitOps workflows, and declarative manifests; orchestrated zero-downtime rolling and canary deployments to minimize service disruption and enable rapid, reliable delivery of new features
  • Integrated ServiceNow Change Management into GitLab CI and Jenkins pipelines to enforce automated governance, ensuring production deployments met audit-readiness standards.
  • Automated API Gateway deployment and version management using AWS CloudFormation and CI/CD pipelines, ensuring seamless updates and rollbacks for API changes

Devops Architect

Merck Pharmaceuticals
Branchburg Township, New Jersey
11.2019 - 06.2021
  • Provisioned and managed persistent storage for stateful workloads using Persistent Volumes (PV), Persistent Volume Claims (PVC), dynamic storage classes, and cloud-native provisioners, ensuring data durability and performance for databases and critical services
  • Defined and enforced Kubernetes resource quotas and limits at the namespace level using YAML-based configurations, actively monitored resource consumption, and iteratively tuned quotas to optimize utilization and prevent resource starvation or overprovisioning
  • Established comprehensive monitoring and alerting for clusters and workloads using Prometheus and Grafana, integrating with cloud-native logging and alerting systems to enable real-time visibility, proactive incident response, and SLO/SLA compliance
  • Configured private API Gateway endpoints with VPC integration and custom domain names for secure internal access to backend services.
  • Built Splunk and Datadog dashboards and alerts for full-stack observability.
  • Integrated Lambda with DynamoDB Streams for real-time event processing and automated remediation workflows.
  • Configured Route Tables in Amazon VPC and set up Application Load Balancer (ALB)/Network Load Balancer (NLB) with Target Groups for external traffic distribution. Leveraged Auto Scaling Groups and Multi-AZ deployments to ensure high availability and fault tolerance for applications
  • Designed cross-account ECR pipeline using Terraform null resources, securing Docker deployments with vulnerability scanning that blocked 150+ critical CVEs during image promotionReduced DynamoDB costs by 25% by optimizing partition keys, enabling on-demand capacity mode, and implementing TTL for data lifecycle management.
  • Automated DynamoDB backups & compliance using Point-in-Time Recovery (PITR) and AWS Backup, meeting strict RPO/RTO requirements.
  • Integrated DynamoDB with Lambda for real-time processing (e.g., user activity tracking), leveraging DynamoDB Streams and EventBridge

DevOps Engineer

Bloomberg LP
New York, New York
04.2017 - 11.2019
  • Developed Ansible playbooks in YAML to automate provisioning of infrastructure and Kubernetes clusters.
  • Created Kubernetes resources (deployments/pod/services) using YAML files.
  • Provisioned new instances in AWS, including S3 Storage Services and AWS EC2.
  • Created AWS orchestration scripts and designed new back-end services, expanded AWS infrastructure, and mentored team members.
  • Set up application monitoring using Prometheus and federated Thanos.
  • Worked on CI/CD Pipeline (Groovy) using Git, Bit Bucket, Jenkins and Kubernetes.
  • Managed Helm charts and shared applications as Kubernetes charts; monitored applications with Prometheus and created reproducible infrastructure as code using Ansible and YAML.
  • Created robust CI/CD pipelines using Jenkins and integrated cloud orchestration scripts.

DevOps Engineer

Coca-Cola
Atlanta, Georgia
11.2013 - 02.2017
  • Developed POC for on-premise to AWS migration utilizing EC2, S3, ELB, Auto-Scaling, VPC, Route 53, Cloudwatch.
  • Created the automated build and deployment process for application and led up to building a continuous integration system for all our products.
  • Instrumental in developing Jenkins build pipeline jobs using groovy for Node.js, and Java-based applications.
  • Participated in weekly meetings with cross-functional teams to address blockers, discuss feature enhancements, and onboard new projects.
  • Led AWS migration POC and built Jenkins pipeline automation using Groovy for Java/Node.js apps.

Software Engineer

BMW Manufacturing Corporation
Greenville, South Carolina
10.2011 - 10.2013
  • Led design and implementation of WM and PP solutions, enhancing small parts warehouse operations.
  • Developed WRICEFs to interface with external and boundary systems, supporting Just In Time (JIT) delivery.
  • Created equipment availability report to enable service centers to respond effectively to customer requests.
  • Configured and customized SAP MM functionalities including Master Data Management, Purchasing Management,
  • Inventory Management, and Material Valuation, resulting in improved operational efficiency and accuracy

Software Engineer

Infosys Technologies
Mysore, India
06.2007 - 01.2010
  • Implemented BAPI_PO_CHANGE to enhance functionality of purchase order processing, facilitating smoother transactions.
  • Modified automatic PO close program to list and close purchase orders meeting specified goods receipt percentage, improving order management efficiency.
  • Remediated Order Maintenance Audit report for compatibility with upgraded version, ensuring accurate data reporting.
  • Developed interfacing programs to copy VAS master data from source client to a target client

Education

MBA - Management Information Systems

University of Arkansas
Little Rock, AR
05-2011

Skills

  • Cloud infrastructure engineering
  • Cloud Security & Compliance
  • CI/CD automation
  • Containerization & Orchestration
  • Cloud migration
  • Infrastructure automation
  • MLOps
  • Observability & Monitoring
  • AWS Azure GCP architecture
  • Programming & Scripting: Python, Bash/Shell Scripting

Certification

AWS Certified Security – Specialty

Timeline

Site Reliability Engineer | DevSecOps Engineer

Amtrak
08.2025 - Current

Cloud Infrastructure Engineer | SRE

Morgan Stanley
06.2021 - 09.2025

Devops Architect

Merck Pharmaceuticals
11.2019 - 06.2021

DevOps Engineer

Bloomberg LP
04.2017 - 11.2019

DevOps Engineer

Coca-Cola
11.2013 - 02.2017

Software Engineer

BMW Manufacturing Corporation
10.2011 - 10.2013

Software Engineer

Infosys Technologies
06.2007 - 01.2010

MBA - Management Information Systems

University of Arkansas
PRAVEEN JUTUR