Summary
Overview
Work History
Education
Skills
Timeline
Generic

DEEPIKA

Sr. Cloud/DevOps Engineer

Summary

Around 9 years of IT expertise in DevOps, CI/CD, Terraform, Build Automation Strategies, Configuration Management, AWS/Azure/Google Cloud Platforms and Implementations.

  • Implemented DevOps methodologies with end-to-end configuration and SDLC management, automation build and deploy operational enhancements across Dev, QA, and Prod environments, resulting in a 25% increase in deployment efficiency.
  • Automated infrastructure setup, configuration and management using Azure, AWS, and Google Cloud, reducing the manual effort by 40%.
  • Developed and maintained Big Data solutions using Hadoop, Spark, HDFS, Hive, Kafka, Elastic Search, and ZooKeeper, supporting data processing for 100+ TB of data.
  • Implemented automation strategies in infrastructure as code with Terraform, ARM templates, AWS cloud formation templates, Ansible, Docker, PowerShell, Chef and Vagrant.
  • Created and managed Jenkins jobs, plugins, and test case integrations, optimizing CI processes for 15+ projects. Handled IAM for Azure Subscriptions, Azure AD, AD Application Proxy, Azure AD Connect, and Pass-through Authentication, enhancing security compliance by 35%.
  • Automated CI/CD pipelines for Azure cloud-based data migration using Azure DevOps(VSTS), PowerShell, and GIT, increasing deployment frequency by 40%.
  • Worked on container-based technologies like Docker, Kubernetes and OpenShift for creating new projects.
  • Managed container-based deployments using Docker, Kubernetes, and Docker Swarm, AWS EKS, reducing deployment errors by 20%.
  • Deployed and managed Kubernetes clusters using KOPS, Helm charts, and local deployments, ensuring high availability for 50+ microservices.
  • Leveraged on Google Cloud Platform (GCP) services like Compute engine, Cloud storage, Cloud load balancing, Cloud SQL, Stack driver monitoring and Cloud deployment manager.
  • Configured and migrated database servers with Azure SQL, MySQL, Oracle DB, MongoDB, PostgreSQL, DynamoDB, and Cassandra, ensuring data integrity and security.
  • Managed build and release of cloud-based products on Linux and Windows environments using PowerShell, TFS, and Python scripting.
  • Wrote Ansible playbooks and managed AWS nodes, provisioning development and QA servers, increasing scripting efficiency by 20%.
  • Implemented monitoring tools like Splunk, Nagios, Prometheus, Grafana, Cloud watch, App Dynamics, New Relic and Stack Driver. Proficient in Networking and configuring TCP/IP, DNS, NFS, NIS, NIS+, SAMBA, LDAP, SSH, SSL, SFTP, SMTP, SNMP servers.

Overview

9
9
years of professional experience

Work History

Sr. Cloud/DevOps Engineer

TikTok
05.2022 - Current
  • Provided on-call support to resolve the production incidents and ensure services remain reliable, highly available, and scalable
  • Managed Tier-1 online incident response, resolving and preventing production issues
  • Collaborated with teams to define and implement Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Service Level Agreements (SLAs)
  • Monitored and reported on service level objectives for application services
  • Implemented an automated alerting system reducing false positives by 70%, decreasing detection time for major incidents by 50% and ensuring timely response to critical alerts
  • Worked on a project that resulted in the reduction of resource consumption by 30% through container optimization and scalability enhancements
  • Enhanced production environment monitoring to expedite issue resolution
  • Focused on system capacity and online stability to improve processing efficiency
  • Installed and upgraded Kubernetes clusters, implemented autoscaling for nodes and creating cluster IP, NodePort and LoadBalancer, enhancing system scalability
  • Utilized horizontal pod autoscaling and troubleshooting issues both at application and cluster level
  • Developed and maintained automated procedures to maximize system efficiency
  • Implemented Elastic stack of Elasticsearch, Kibana, APM Server on Kubernetes environment using helm to monitor and perform a log analysis
  • Implemented monitoring tools and metrics to track system health and performance, which reduced system incident detection time by 50%
  • Wrote Golang code to pull data from Kinesis and load data to Prometheus and used by Grafana to visualize metrics
  • Implemented robust monitoring solutions using Grafana ensuring swift resolutions and reducing the incident response times by 40%
  • Created Grafana dashboards to monitor deployment health, service status and API metrics
  • Handled incident management, problem-solving, post-mortem analysis, security, and compliance
  • Optimized and standardized alarm systems to enhance on-call experience
  • Managed system backups and disaster recovery for high-availability, achieving a 99.5% effective recovery rate from incident
  • Developed an extensive disaster recovery plan, reducing the potential data loss scenarios by 75% through strategic backup procedures and failover mechanisms
  • Developed runbooks and Standard Operating Procedures (SOPs) for faster issue resolution
  • Optimized user experience through data monitoring, problem aggregation, and circulation capabilities
  • Assisted with testing and validating production applications
  • Hosted the scrum meetings for the team.

GCP Cloud Engineer

EQUIFAX
08.2021 - 04.2022
  • Designed and deployed Terraform in the cloud deployment manager to spin up resources like cloud virtual networks, Compute Engines in public and private subnets along with Auto Scaler in Google Cloud Platform (GCP)
  • Worked on configuration and deployment of the VM instances on GCP environments and Data centers
  • Managed the Infrastructure on Google cloud Platform (GCP) using various GCP services such as Compute Engine, Cloud Functions, Cloud DNS, Cloud Storage and SaaS, PaaS and IaaS, Big Query, Pub/Sub
  • Created Virtual Machines in Google Cloud Platform (GCP), installed the Docker and Docker swarm
  • Built and maintained the Docker container clusters managed by Kubernetes Linux, Bash, GIT, Docker, on Google Cloud Platform (GCP) and utilized Kubernetes and docker for the runtime environment of the CI/CD system to build, test and deploy
  • Worked on maintaining the user accounts (IAM), Cloud SQL, Cloud DNS, VPC, RDB and Cloud Datastore services in Google Cloud platform (GCP)
  • Developed a highly scalable data model and data warehouse using Snowflake, resulting in a 40% improvement in data processing speed and a 25% reduction in storage costs
  • Migrated on-premises ETLs to Google Cloud platform (GCP) by utilizing cloud native tools such as Big Query, Cloud data Proc, Google Cloud Storage, Composer
  • Optimized ETL processes for loading data into the Snowflake, reducing 50% data loading time and improving 15% of the overall data quality
  • Created Jenkins pipelines to build and deploy microservices onto Kubernetes cluster of Pods and Docker containers serving as Master/Slave nodes servers end users
  • Streamlined data processes by creating automated Python scripts, reducing the manual intervention by 50%
  • Created GCP IAM roles, policies, and service accounts
  • Handled monitoring and management of instance groups with the Google Cloud Monitoring and worked on application monitoring with AppDynamics on Google Cloud Platform (GCP)
  • Created and implementing application performance monitoring, and logging strategies using AppDynamics.

Cloud Automation Engineer

AT&T
05.2020 - 07.2021
  • Migration of On-Premises data services to Azure with Infrastructure as a Service using Terraform
  • Configuration and management of build and deployment pipelines with YAML (pipeline as a code)
  • Provisioned and managed Azure Paas resources i.e., Key vault, Function Apps, AKS, Blob Storage, Application Gateway using automation scripts
  • Automated the Azure Iaas virtual machines and deployed the virtual machine scale sets using the terraform modules in pre-production/production environments
  • Deployed and configured VMs availability sets for resiliency for the IaaC based solutions using the Azure Resource Manager
  • Configured the Azure services such as Azure Cloud services, Azure storage, Azure Active Directory, Azure Blob Storage, Azure Key vault, Azure virtual networks, Subnets, Network security groups, Azure VMs, Azure VMSS, Azure Site Recovery, Azure Functions, Azure Monitor, ensuring 99.9% uptime
  • Implemented data solutions for integration of Azure storages, processing and visualization with monitoring tools
  • Managed and monitored Azure Data Lakes (ADLS) and Azure Data Analytics integrating with Azure resources
  • Worked in Integrating an application with Azure AD, Design/Implement a multi-site or hybrid network, set up Site to Site & Point to Site VPN between on-prem and Azure Networks, Design/Implement Azure Site Recovery/Azure Backup, Implement Azure RMS and EMS
  • Utilized azure provider in terraform for deploying custom configured Azure service architecture environment, which is of services like Subnets, VM’s, Load Balancers, Security Groups, CDN, Monitor, App Service, Functions, Blobs, SQL Databases, Virtual Disks, DNS, Notification Hubs, Queues, Key Vault, Search, AD
  • Developed terraform modules to automate provisioning azure VM’s for application deployment and state files are stored within Azure Blob Storage
  • Automated azure resources in multi-platform environment such as Linux and Windows
  • Deployment of Azure Functions with application insights for monitoring the applications
  • Configured and managed RBAC for users, roles and resources over Azure Active Directory
  • Deployment of Look Ahead environment and testing and perform Smoke testing/Sanity testing of Pre-Prod deployed Environments
  • Automated deployment processes with python, deploying .Net applications on servers with CI/CD pipelines in Azure, leveraging pipeline as code with YAML and PowerShell
  • Automated CI/CD pipeline for the Azure cloud-based analytical data migration using Azure DevOps, PowerShell and GIT as versioning and controlling tool
  • Utilized Azure DevOps (VSTS) pipelines to build and deploy microservices, create pods, config maps and deployments of helm charts into the Kubernetes cluster
  • Deployed and configured the Oracle DB, MySQL DB, Mongo DB, PostgreSQL DB through the automation scripts using the CI/CD pipelines
  • Installed and configured the Azure Site Recovery and Azure Backup and enable the Azure Virtual machine backup from the vault.

DevOps Engineer

United Health Group
09.2016 - 06.2018
  • Worked on managing the Infrastructure on AWS and Azure cloud platforms
  • Designed and deployed a multitude application using AWS stack (Including EC2, Route53, S3, RDS, DynamoDB, SNS, SQS, IAM) & used MySQL, DynamoDB and Elastic Cache to perform basic database administration
  • Managed Amazon Web Services infrastructure with automation and configuration management tool Ansible
  • Created IAM policies for roles and users supporting remote applications across the globe, transfer data from Data Centers to cloud using AWS import/export service
  • Created and implementing application performance monitoring, and logging strategies using New Relic
  • Designed AWS Cloud Formation templates to create custom sized VPC, subnets, NAT to ensure successful deployment of Web applications and database templates
  • Developed AWS Cloud formation templates for provisioning AWS services like VPC, subnets, NAT, S3
  • Developed and launched a data analytics platform on Snowflake Cloud Data Warehouse, which managed over 10 TB of data
  • Automated Jenkins by setting up commit builds to check for compilation failures of checked-in source code by the developers to accelerate CI
  • Deployed java and .net application by developing build scripts in Maven and Gradle to build war, jar build artifacts
  • Experience in using the container based Virtualized deployments using Docker, working with Docker images, Docker hub and Docker registries and creating Docker containers from existing Linux servers
  • Worked on creating Dockers images with Docker files out of source code and pull the Docker images and run-on Test, Stage, and Production environments
  • Worked on deploying AWS instances with Security groups, route tables with whitelisting and blacklisting the traffic, ELB, fault tolerant and high available systems, auto scaling for cost cutting
  • Configured and Managed Apache web server and Bastian host by redirecting the public traffic
  • Worked with Jenkins for continuous integration and deployment integrating GIT as a plugin for source code repository for code checkout process, adding storage to the cluster disks and increasing/ decreasing the file system in RHEL
  • Developed Groovy scripts for importing credentials/settings into the Jenkins environment using an initialization script and build pipelines
  • Created and maintained ETL pipelines that processed over 1 million records daily, ensuring high data accuracy and accessibility
  • Worked on OpenShift platform in managing Docker containers and Kubernetes Clusters
  • Configured and integrated the servers for different environments to automatically provision and configuration management of Linux instances using Ansible, improving efficiency by 30%
  • Build and Configured OpenShift Infrastructure to create Kubernetes clusters, nodes and pods using Ansible playbooks
  • Automated scalable server provisioning and configuration management using Ansible playbooks
  • Developed automation scripts in Python, Shell for administration tasks like file system management, process management, backup and restore by creating Cron jobs.

Linux system administrator

Newgen Software Technologies Ltd
12.2015 - 05.2016
  • Installation, integration and management of data backup/recovery solutions
  • Management and configuration of VMWare machines running Oracle/Sun Solaris, Red Hat Enterprise Linux and Oracle Linux server
  • Installed and configured servers on VMware ESX for various applications via kickstart, PXE
  • Administration of UNIX servers like AIX and Sun Solaris in both test and production environment and applied patches
  • Created and modified user, groups with sudo permission
  • Extensively used Splunk for log analyzing and monitoring network infrastructure using Nagios Generated custom plugins for automating the build activities in QA, Staging and Production environments
  • Scheduled and managed cronjobs, batch processing and job scheduling using crontab, wrote shell scripts to automate System Process and troubleshooted with the help of netstat, ping, NS lookup and traceroute tools
  • Installed and configured LAMP stack (Linux, Apache, MySQL, and PHP) for various new and existing applications
  • Troubleshooting Linux network, security related issues, capturing packets using tools such as IP tables, firewall, TCP wrappers, NMAP
  • Maintained Samba File Server for user authentication, syslog domain, and file sharing in Linux/Unix
  • Managed TCP/IP packets & DHCP servers, resolved TCP/IP network access problems for the clients and worked with various TCP/IP implementations like NFSv4, NIS, DNS and DHCP.

Education

Master’s in Computer Science -

Louisiana State University
Shreveport, LA
01.2019

Bachelor of Technology in Electronics and Communication -

Uttarakhand Technical University
01.2014

Skills

Infrastructure Automation: Terraform, Ansible, AWS

CloudFormation, ARM, Chef, Salt Stack, Puppet

CI/CD: Azure DevOps, Jenkins, Maven, SonarQube, Packer, Bamboo, JIRA

Cloud Platforms: Azure, AWS, OpenStack, GCP, PCF

Big Data/Hadoop Technologies: Apache Hadoop, Spark, HDFS, Hive, Cassandra, Elastic search, Kafka, Zookeeper

Version Control Tools: Git, GitHub/Bitbucket, Subversion

Microservices: Docker, Kubernetes, AWS ECS, DTR, ECR

Operating Systems: Windows, Linux, CentOS, Ubuntu

Application Servers and Web Servers: Apache Tomcat, JBOSS, Web Logic, Web Sphere, Nginx, Apache HTTP, SQL Server

Languages: C, C, Java, Go, SQL, ASPNET

Logging and Alerting: ELK, Splunk, Cloud Watch, Nagios, Prometheus, Grafana, App Dynamics, SNS

Databases: Oracle DB, Mongo DB, PostgreSQL, MySQL

Scripting: Python, Shell, PowerShell, Ruby, Selenium, Groovy, JavaScript

Timeline

Sr. Cloud/DevOps Engineer

TikTok
05.2022 - Current

GCP Cloud Engineer

EQUIFAX
08.2021 - 04.2022

Cloud Automation Engineer

AT&T
05.2020 - 07.2021

DevOps Engineer

United Health Group
09.2016 - 06.2018

Linux system administrator

Newgen Software Technologies Ltd
12.2015 - 05.2016

Master’s in Computer Science -

Louisiana State University

Bachelor of Technology in Electronics and Communication -

Uttarakhand Technical University
DEEPIKA