Summary
Overview
Work History
Education
Skills
Timeline
Generic

SAI TEJA

Richmond,VA

Summary

Senior DevOps/Cloud Engineer with 10+ years of experience in cloud infrastructure (AWS, Azure, GCP), CI/CD automation, and Kubernetes.
Expert in Terraform, Jenkins, Python scripting, and BI platform operations (Tableau, Looker, QlikSense).
Proven track record in securing, scaling, and optimizing enterprise cloud and BI environments.

Overview

10
10
years of professional experience

Work History

SRE/DevOps/ BI Operations Engineer

Expedia Group
10.2020 - Current
  • Monitored and managed Tableau extract refreshes and Looker PDT schedules to provide timely and accurate delivery of dashboards to business stakeholders.
  • Troubleshoot Looker dashboards by identifying broken LookML references, permission issues, or stale PDTs, working with data teams to resolve root causes.
  • Provided day-to-day support for Tableau and Looker users, including access requests, report performance issues, and dashboard version control.
  • Performed weekly Qlik Sense platform maintenance, including reload monitoring, license audits, resource usage tracking, and log review, ensuring 99.9% platform uptime.
  • Documented recurring BI issues and resolutions, contributing to the BI support knowledge base and improving team efficiency.
  • Creating automation with Python for auto-discovering AWS Public Endpoints and scanning them using Rapid7 InsightVM.
  • AWS Administration and Infra automation using Terraform and CloudFormation.
  • Using Terraforms to set up the AWS infrastructures such as launching the Ec2 instances, S3 bucket objects, VPC, and Subnets.
  • Using Lambda for various automation in AWS.
  • Securing AWS infrastructure by following methodologies and best practices.
  • Created system alerts using various DataDog tools and alerted application teams based on the escalation matrix.
  • Centralized Security Logging with Elasticsearch and Maintaining 45 AWS.
  • Implemented Scout2 across accounts for AWS Misconfiguration.
  • Using Kubernetes to manage containerized applications using its nodes, Config Maps, Namespaces, Service Meshes, selectors, Services, and deployed application containers as Pods.
  • Managed Docker orchestration and containerization using Kubernetes, scaling and managing Docker Containers.
  • Setting up the DataDog monitoring across different servers and AWS services.
  • Created DataDog dashboards for various applications and monitored real-time and historical metrics.
  • Did set Zabbix and Grafana for the monitoring and Dashboards related to Server.
  • Create multiple GitLab pipelines with automation with Terraform and for Different Java Applications.
  • Helped Developers to Dockerize the application.
  • Log correlation and analysis for Security Incidents.
  • Configured multiple security tools for security: Zap/Sslyze/Nmap/etc.
  • Configured the GuardDuty, AWS Config, and Security Hub across the accounts.
  • Enabled WAF for multiple Load Balancers and Cloudfront.
  • Used Ansible to automate configuring the servers and different server installations.
  • Configured Cloudhealth for Security as well as Cost Estimation purposes.
  • Working on Kubernetes Setup and Security.
  • Environment: Integration, Jenkins, Git, SVN, AWS services, Cloud Security, Rapid7 InsightVM, Apache Tomcat, Linux, SQL, Docker, JIRA, Puppet, Python, Shell scripting, Lambda, CloudWatch, Zabbix, Ansible, Grafana, Mongo, etc.

Sr. AZURE DevOps Engineer

Comcast
04.2018 - 09.2020
  • Working on Azure VPN-Point to Site, Virtual networks, Azure Custom security, End,point security, and firewall.
  • Creating the build and release pipelines using Azure DevOps for .Net core and Java-based projects and experience working with Octopus Deploy for automatic workflow.
  • Working on nested template design to create Azure Resources for each environment by creating an automated build and release pipeline.
  • Experience with Azure Site Recovery, Operations Management Suite, Power shell Scripts, and ARM templates.
  • Experience in managing hosting plans for Azure Infrastructure, implementing, and deploying workloads on Azure Virtual Machines (VMs).
  • Monitoring Kubernetes clusters by deploying sidecar Prometheus exporters as a data aggregator and Grafana as a data visualization platform.
  • Deployed AZURE Kubernetes Service (AKS) in Azure by using template-driven deployment options such as Resource Manager Templates and Terraform Deployed Azure Big Data Cluster on ARO (Azure Red Hat Openshift) with profiles, Created patch files to add Active directory user to ARO.
  • Written Python scripts for deploying Big Data Cluster using custom profiles on Kubernetes and OpenShift environments.
  • Developing microservices onboarding tools leveraging Python and Azure DevOps allowing for easy creation and maintenance of build jobs and Kubernetes deploy and services.
  • Worked on Serverless services, created and configured HTTP Triggers in the Azure Functions with application insights for monitoring and performing load testing on the applications using Azure DevOps Services.
  • Installed and Configured Pivotal Cloud Foundry (PCF) Application Manager, Configured LDAP for authorization, and configured Log generator for logs in PCF (Splunk).
  • Converted existing Terraform modules that had version conflicts to utilize ARM during Terraform deployments to enable more control or missing capabilities.
  • Working with Terraform Templates to automate the Azure Iaas virtual machines using Terraform modules and deployed virtual machine scale sets in a production environment.
  • Controlled and automated application deployments and updates and orchestrated deployments using Kubernetes.
  • Deployed workload and configuration builds using Docker, Kubernetes, and Azure CLI.
  • Worked with Dynatrace Guardian to obtain, debug, and update custom monitors and plug-ins.
  • Orchestration improvements to Dynatrace deployment to reduce upgrade time.
  • Integrated Dynatrace with Active Directory, Email Servers, and Event Management.
  • Working on Helm charts to configure CI/CD pipeline and install relevant plugins.
  • Created and managed a Docker deployment pipeline for custom application images in the cloud using Jenkins.
  • Managing local deployments in Kubernetes creating local Namespace and deploying application containers.
  • Maintained automation configuration management tools like Chef and continuous integration/deployment and continuous monitoring solutions.
  • Managed Chef Playbooks with Chef Roles. Used file module in Chef Playbook to copy and remove files on Nodes.
  • Responsible for installing Jenkins master and slave nodes and configuring Jenkins builds for continuous integration and delivery.
  • Created Jenkins pipelines for several Downstream/Upstream job configurations based on dependencies from other applications & based on Release methodologies.
  • Developing custom solutions in C# and PowerShell to validate the availability, consistency, and compliance of environments.
  • Working with source control management tool Gitlab and creating GIT repositories with specified branching strategies.
  • DevOps role converting existing Azure infrastructure to server-less Architecture deployed via Terraform.
  • Developed new Splunk apps to monitor the application log volume (Event count), Indexing volume, missing events, and missing hosts/source/source type from Splunk monitoring.
  • Have experience in Splunk operational intelligence tool, create complex searches, dashboards and alerts.
  • Written Power Shell scripts for archiving and moving of older log files to Azure Storage and automation scripts using Python boto3.
  • Implementing Database Deployment using CI/CD process on Azure SQL Database.
  • Deployed a LAMP server from the command line and migrated the MySQL database and PHP code from Windows Server to CentOS (Red Hat).
  • Environment: Azure, ARM Templets, Azure Virtual Machines, Azure DevOps (VSTS), Pivotal Cloud Foundry, Terraform, Kubernetes, OpenShift, Grafana, Dynatrace, Helm Charts, Docker, Chef, Jenkins, GIT, Splunk, Power Shell Script, C#, .Net, Oracle, Linux/Unix, Red Hat, Centos.

DevOps/AWS Engineer

RMS
01.2017 - 04.2018
  • Build and configure virtual data centers in the AWS to support Data Warehouse including VPC, Public and Private Subnets, Security Groups, Route Tables, and Elastic Load Balancer.
  • Written Templates for AWS infrastructure as a code using Terraform to build staging and production environments.
  • Worked on the Design and implementation of automation in AWS data center infrastructure using Cloud Formation.
  • Migrated MSSQL Server database from Rackspace to AWS and provided day-to-day support.
  • Defined several modules like compute, Network, Operations, Users, etc. for defining reusable Terraform template files.
  • Used Kubernetes to deploy scale, load balance, scale and manage Docker containers with multiple namespace versions.
  • Written Ansible Playbooks in YAML to create users, install applications like Apache, and Postgres SQL, and integrated Jenkins to provide automation, continuous integration, and managed nodes over SSH using Ansible.
  • Worked on Ansible to configure the servers, deploy software, and orchestrate continuous deployments or zero downtime rolling updates.
  • Worked on AWS for deploying EC2 instances consisting of various flavors viz., Oracle Linux, RHEL, CentOS, Ubuntu, and Solaris in both Linux and Windows.
  • Wrote Terraform scripts to automate AWS services which include Web servers, ELB, Cloud Front distribution, Database, EC2, database security groups, and S3 bucket.
  • Worked on setting up the GCP Firewall rules to allow or deny traffic to and from the VM's instances based on specified configuration.
  • Responsible for daily administration tasks in AWS and GCP cloud environments and Setup Alerting and monitoring using stack driver in GCP.
  • Used Ansible Tower to enhance the scheduling of the multiple configurations and to scale the run-time of the clusters.
  • Virtualized the servers using Docker for the test environments and dev-environments needs and configuration automation using Docker containers.
  • Developed container support for cloud environments and Docker. Deployed the applications in Docker containers and managed the containers at the enterprise level.
  • Experience testing Kubernetes clusters using super healthy to make sure pods are highly available, component health, Excessive Pod Restarts along with security considerations.
  • Performed High availability & scalability testing across multiple Kuber clusters under simulated heavy load.
  • Creating new build jobs, Integration testing jobs, and deploying jobs in Jenkins to automate the process.
  • Configured Jenkins CI tool to automate the regression testing by including test cases using Selenium Plugin in Jenkins.
  • Developed Maven Scripts to automate the compilation, deployment, and testing of Web and J2EE applications.
  • Worked on Branches, tagging, merges, cherry-picks, user permissions, release process in GIT.
  • Monitoring API’s and Cassandra’s performance using New Relic, Cloud Watch and Log stash.
  • Helped release the Docker based New Relic browser testing solution.
  • Migrated Oracle Databases from on-premises to AWS RDS - Oracle, Postgres, MySQL (views, stored procedures, functions, materialized views).
  • Experience in writing the infrastructure automation scripts in Python and Ruby Source Code Management.
  • Experience of working with the release and deployment of large-scale Java/J2EE Web applications.
  • Working on NoSQL databases including HBase, Mongo DB, and Cassandra, Configuring Apache Mesos and Marathon using cloud formation templates on Ubuntu.
  • Environment: Amazon wed services (AWS EC2/AMIs/VPC/S3/IAM/glacier), Open Stack, RHEL, Centos, Ubuntu, Solaris, Route Tables, Terraform, Kubernetes, Ansible, YAML, Apache, Postgres, CI, SSH, Jenkins, Maven, My SQL Server, Cassandra, New Relic, Java/J2EE.

Sr. Site Reliability Engineer

Capital One
12.2016 - 12.2017
  • Created tagging standards for proper identification and ownership of EC2 instances and other AWS resources.
  • Deployed Extra Hop Monitoring tool on AWS to identify workloads to migrate to AWS and to optimize the performance of all Dependent Infrastructure Services.
  • Used AWS Beanstalk for deploying and scaling web applications and services developed with Java.
  • Experience in writing test cases, and modules and integrating with Bamboo.
  • Administered and Implemented CI tool Bamboo for automated builds and migrated all projects from perforce to sub-version.
  • Performed all the build and release management activities for this project by administrating and integrating Source Code control systems like SVN.
  • Used Jenkins and pipelines to drive all microservices builds out to the Docker registry and then deployed to Kubernetes, Created Pods and managed using Kubernetes.
  • Configure XL Deploy and Maven for the Deployment of packages to the environment.
  • Installed and configured performance monitoring tools like Nagios and alert solutions for server disk management.
  • Monitored all the hosts and their services in the Nagios monitoring tool daily.
  • Written multiple Playbooks in Ansible, and implemented environments, and roles in Ansible for better environment and security management.
  • Demonstrated on Ansible along with Ansible Tower can be used to automate different software development processes all over the team organization.
  • Created BASH shell scripts to automate Cron jobs and system maintenance and scheduled Cron jobs for job automation.
  • Environment: AWS (EC2/Route53/AMIS/VPC/S3/RDS/DynamoDB/SNS/SQS), GCP, Ansible, Python, Bamboo, SVN RedHat Linux (4.x/5.x/ 6.x), Maven, Nagios, Power Shell, Bash Shell, Corn Jobs, SQL, Linux, Solaris (9/10/11).

Build Release Engineer

Sorenson Media
09.2015 - 11.2016
  • Install Jenkins on a Linux machine and create a master-slave configuration to implement multiple parallel builds through a build farm.
  • Created Chef Cookbooks to deploy new software and plugins as well as manage deployments to the production Jenkins server.
  • Created cookbooks, and playbooks in Chef and ANSIBLE for the administration of both Linux and Windows servers.
  • Worked on ANSIBLE playbooks to create the images in our private cloud.
  • Extensively worked on Jenkins by installing, configuring, and maintaining Continuous Integration (CI) and End-to-End automation for all builds and deployments.
  • Worked in all areas of Jenkins setting up CI for new branches, build automation, plugin management and securing Jenkins, and setting up master/agent configurations.
  • Installed and configured Jenkins, installed plug-in, configured security, and created a master and slaves for implementation of multiple parallel builds.
  • Implemented and Maintained the Branching and Build/Release strategies utilizing GIT source code management.
  • Used build tools like ANT for the building of deployable artifacts such as war & ear from source code.
  • Wrote Python scripts with Cloud Formation templates to automate the installation of Auto scaling, EC2, VPC, and other services.
  • Wrote scripts in Shell and YAML to automate daily tasks and deployments to eliminate the manual process.
  • Converted Java projects into Maven projects by creating POM files and ensured all the dependencies were built.
  • Deployed SQL scripts in Oracle, and Abinitio tags in various test environments.
  • Environment: Linux (RedHat 7.x, 6.x, 5.x), Chef, Jenkins, CI/CD, GIT, ANT, Nagios, Python, YAML, Java, Maven, .Net, SQL Script.

Education

MBA - Information Technology

Campbellsville University
Campbellsville, KY
11-2020

Skills

  • DevOps practices
  • Tableau proficiency
  • Data visualization expertise
  • Jenkins pipeline management
  • Infrastructure as code with Terraform

Timeline

SRE/DevOps/ BI Operations Engineer

Expedia Group
10.2020 - Current

Sr. AZURE DevOps Engineer

Comcast
04.2018 - 09.2020

DevOps/AWS Engineer

RMS
01.2017 - 04.2018

Sr. Site Reliability Engineer

Capital One
12.2016 - 12.2017

Build Release Engineer

Sorenson Media
09.2015 - 11.2016

MBA - Information Technology

Campbellsville University
SAI TEJA