Summary
Overview
Work History
Education
Skills
Timeline
Generic

Surya Kranthi Siyadri

Charlotte,United States

Summary

Experienced and dedicated Cloud Engineer with a proven track record in optimizing development and operational processes to enhance efficiency and productivity. Possesses a solid background in cloud technologies such as AWS, Azure, and GCP, specializing in designing, implementing, and managing CI/CD pipelines for seamless software delivery. Proficient in containerization using Docker and Kubernetes, with successful implementation of resilient microservices architectures. Skilled in languages such as .Net Core, Python, Java, and JavaScript, with an understanding of the intricacies of application development. Recipient of the IBM Service Delivery Excellence award, committed to continuous improvement. Excels in dynamic environments and is dedicated to achieving optimal results through teamwork and innovation.

Overview

9
9
years of professional experience

Work History

Sr. DevOps Engineer

World Group Inc
06.2023 - Current

• Performed dependency and risk analysis to break down a mono repo into polyrepo to increase scalability, reduce build time, isolate CI/CD pipelines, and have fine-grained access control.
• Analyzed the existing Google Cloud (GCP) and DevOps setup for the client and provided recommendations in the areas of Security, Automation, Auditing, and Standardization.
• Evaluated and documented the existing CI/CD setup detailing the current build and release processes in Cloud Build and Cloud Composer pipelines.
• Collaborated with frontend developers to seamlessly integrate Flutter components into an existing Angular web application, enhancing UI responsiveness and improving user experience.
• Engineered modular Flutter components to efficiently handle dynamic content rendering, reducing development time by 25%.
• Developed a robust backend architecture for a mobile application using .Net Core and C#, ensuring scalability, reliability, and security.
• Ensured secure user authentication and data management, implementing RESTful APIs to facilitate seamless communication between the mobile app and backend services, resulting in a 50% improvement in response time.
• Established API connections to integrate various Google and external services with Cloud Workflows, enabling seamless communication and data exchange. This integration enhanced workflow capabilities and interoperability.
• Utilized Docker containers to streamline the deployment process of a microservices-based application, reducing deployment time from hours to minutes.
• Remediated integration failures by updating Certificates on the GCP resources and setting up monitoring and alerting for certificate expirations.
• Created workflows in Cloud Workflows to extract PDFs from Gmail and send them to the client-server through Pub/Sub integration.
• Developed Cloud Functions using Python to streamline data operations and automate the conversion of Excel files to PDF format, meeting end client requirements for data training.
• Orchestrated the creation of development and production environments on GCP, provisioning Compute Engine instances, managed instance groups, VPC networks, and load balancers to ensure robust performance and scalability for critical applications.
• Connected the IBM MQ message queues to the Hyperscience client server for training data models and extracting the required fields in JSON.
• Managed and maintained Google Kubernetes Engine (GKE) clusters, ensuring high availability, scalability, and performance of containerized applications.
• Proficiently orchestrated the deployment of pods within GKE clusters, utilizing YAML manifests and Helm charts to define and configure containerized applications.
• Implemented robust security measures by regularly updating and managing secrets within Kubernetes clusters, safeguarding sensitive information, and ensuring compliance with security best practices.
• Resolved ingress issues efficiently by diagnosing and fixing misconfigurations using Helm charts and Kubernetes resources, optimizing traffic flow, and enhancing application accessibility.
• Integrated Data Studio with BigQuery data warehouse, enabling the creation of interactive dashboards and reports for business stakeholders, improving data-driven decision-making processes.
• Implemented Google Cloud SQL for SQL Server to manage databases for .NET applications, reducing database maintenance time by 60% and improving reliability.
• Integrated Docker within Cloud Build pipelines to facilitate containerized builds and deployments, ensuring consistent environments across development, testing, and production stages, thereby enhancing the portability and reproducibility of software artifacts.
• Implemented CI/CD pipelines using Cloud Build and GitHub Actions, ensuring automated testing and deployment.

Site Reliability Engineer (SRE)

Capital One
05.2021 - 05.2023
  • Worked as a SPOC and contributor for Datastax Cassandra Gear overseeing data storage, cluster creation, modification, rehydration, and deletion operations
  • Implemented diverse functionalities and addressed reported bugs within the gear as per user requests, ensuring optimal performance and reliability
  • Worked on the migration process of transitioning the DSE-Cassandra gear to a managed pipeline gear, effectively transferring Jenkins control to the centralized team responsible for maintaining and overseeing Jenkins operations across the organization
  • Implemented Jenkins-based infrastructure as code solutions using tools like Ansible and Terraform to automate the provisioning and configuration of AWS resources for CI/CD pipelines
  • Implemented parallel rehydration of the Cassandra datacenters that significantly decreased the rehydration time by 50% saving teams with system downtime
  • Enabled users to execute per datacenter rehydration processes through Jenkins, streamlining database fixes and rehydration operations within individual datacenters seamlessly
  • Added lambda layers to connect the lambda function with the Vault and read the secrets during the gear provision
  • Enhanced functionality within Jenkins to automate the installation of Docker on Cassandra instances and orchestrated the deployment and maintenance of AWS RDS instances, optimizing database performance, security, and availability for critical applications
  • Implemented high availability and disaster recovery strategies for AWS RDS databases, including multi-AZ
  • Worked on various Secure Guardian vulnerabilities such as findings from Qualys Vulnerability Scan, Java version upgrades and Security Group upgrades
  • Implemented encryption-at-rest and encryption-in-transit for sensitive data stored in AWS RDS databases, ensuring compliance with data security and privacy regulations
  • Worked with Hashicorp Vault to add and remove the lockboxes, lockbox paths, list the lockboxes and access and read and write the secrets
  • Parametrized the acceptance test and wrote multiple test scenarios/cases for integration testing and to achieve test automation
  • Added additional monitoring abilities on the Cassandra resources, providing the users to choose between NewRelic and Datadog to import the CloudWatch logs
  • Collaborated with frontend developers to integrate Vue.js components into an existing Angular web application, improving UI responsiveness and enhancing user experience
  • Implemented reusable Vue.js modules for dynamic content rendering, reducing development time by 25%
  • Incorporated Cloud Custodian policies across the LOB that adds additional compliance on the resources and deleted the unused resources, saving revenue for the organization
  • Refactored custodian off-hour policies and added multiple notify policies that checked the availability of the required tags on the AWS resources
  • Added IAM permissions to the new S3 buckets to access policy execution and platform run logs for Cloud Custodian
  • Played an integral role in enhancing CloudWaze functionalities such as configure Route 53 to fail over traffic between AWS Regions
  • Addressed security vulnerabilities identified within the CloudWaze infrastructure, promptly addressing reported issues while vigilantly monitoring alerting dashboards to ensure proactive risk mitigation
  • Provided guidance and training to teams on optimal utilization of CloudWaze infrastructure during critical disaster recovery events.

Cloud Engineer

Sophos Ltd.
01.2021 - 04.2021
  • Spearheaded the strategic design and execution of GCP cloud environments tailored specifically for reporting databases, meticulously ensuring peak performance, seamless scalability, and robust security measures
  • Leveraged a suite of GCP services including Cloud SQL, Cloud Storage, and Cloud Storage Nearline to orchestrate and implement comprehensive data backup, disaster recovery, and high-availability strategies, fortifying data integrity and ensuring uninterrupted business operations
  • Managed user access and permissions in GCP using IAM, ensuring appropriate access control
  • Utilized service accounts to integrate GCP resources securely, maintaining the integrity of the environment
  • Orchestrated the migration of microservices to an updated platform, enriching functionality and enhancing efficiency, while meticulously upgrading Terraform scripts from version 0.11 to 0.13 to align with evolving infrastructure needs
  • Engineered and crafted new Terraform modules while optimizing existing ones to facilitate the agile deployment of infrastructure as code across diverse environments, encompassing DEV, QA, and PROD, leveraging Grafana and OpsGenie integrations
  • Architected and provisioned resilient Compute Engine Instances using Terraform, innovatively crafting custom plugins to support novel functionality within the Terraform ecosystem
  • Developed Terraform scripts to establish Server-Side Encryption (SSE) and seamlessly integrated Google Cloud Key Management Service (KMS) to encrypt data across GCP workloads, bolstering data security and regulatory compliance
  • Engineered a Python-based Cloud Function to streamline user access to Cloud SQL DB credentials and automate the rotation of secrets containing both master user and app user credentials, enhancing security and operational efficiency
  • Orchestrated the deployment of Grafana containers using Docker images and curated insightful Grafana dashboards, validating infrastructure changes and performance metrics aligned with Terraform deployments
  • Implemented automated infrastructure code validation utilizing Terratest, seamlessly executing Golang based tests within Docker containers orchestrated by Jenkins, ensuring robust code quality and integrity
  • Augmented Terraform automation with custom Bash and Python scripts, leveraging Google Cloud SDK for tasks such as Persistent Disk encryption and Cloud Function scheduling, optimizing automation workflows and enhancing GCP task management
  • Architected and administered Jenkins pipelines to orchestrate infrastructure provisioning from GitHub repositories containing Terraform code, streamlining development workflows and ensuring consistency across deployments
  • Spearheaded the setup of CI/CD pipelines using Jenkins, Maven, and Terraform integrated with GitHub repositories, accelerating software delivery and enhancing development agility
  • Proficiently utilized Atlassian products including Bitbucket, Confluence, JIRA, and SourceTree, providing adept user support and contributing to the seamless collaboration and management of development projects.

Cloud Infrastructure Intern

Geneia
08.2019 - 08.2020
  • Worked with Cloud/infrastructure support teams and designed a highly available secure multi zone AWS cloud infrastructure utilizing Chef with AWS Cloud Formation
  • Implemented a serverless architecture using API Gateway, Lambda, and DynamoDB and deployed AWS Lambda code from Amazon S3 buckets
  • Created a Lambda deployment function and configured it to receive events from S3 buckets
  • Provisioned Lambda functions to create a Logstash for centralized logging
  • Designed roles and groups using AWS Identity and Access Management (IAM)
  • Worked on AWS CLI Auto Scaling and CloudWatch Monitoring creation and update
  • Built and configured EC2 instances on AWS cloud platform, configured Elastic Load Balancer for traffic control for the EC2 and S3 buckets
  • Designed and implemented scalable, Restful and microservices-based mobile back-end, written in Java using Spring Boot for simplicity and scalability
  • Developed back-end logic with Core Java using technologies including Collection Framework, Multi-Threading, Exception Handling, Generics, and Annotation
  • Enhanced user interfaces to enable input of additional personal information for the purpose of plan generation using CSS, HTML, HTML5, JavaScript, and Angular JS
  • Extensively involved in infrastructure as code, execution plans, resource graph, and change automation using Terraform
  • Implemented AWS Code Pipeline and Created Cloud formation JSON templates in Terraform
  • Automated the migration of Subversion (SVN) repositories to GIT while preserving the commit history and other metadata like branches, tags, and authors
  • Experienced in creating scripts in DSL Groovy in Jenkins to automate most of the build-related tasks
  • Expertise in staging and creating CI/CD pipelines and merge changes through SDLC pipeline
  • Managed Java/ J2EE enterprise applications in an agile environment and automated solutions using Python & managed its artifacts in the Jfrog repository
  • Used Maven dependency management system to deploy snapshot and release artifacts to Jfrog to share artifacts across projects
  • Deployed and configured Prometheus to monitor Kubernetes nodes with node-exporter, monitor Kubernetes API and resources with Kube-state-metrics
  • Wrote Chef Cookbooks for various DB configurations to modularize and optimize product configuration, converting production support scripts to Chef Recipes and AWS server provisioning using Chef Recipes
  • Used Chef recipes to set up Continuous Delivery pipeline with Jenkins, SonarQube, and Vagrant
  • Implemented Datadog monitoring and observability solutions to gain real-time insights into application and infrastructure performance
  • Designed and configured custom Datadog dashboards, alerts, and monitors to track key performance indicators (KPIs) and proactively identify and resolve issues.

Research Graduate Intern-Cloud Computing

University of New Hampshire
09.2018 - 08.2019
  • Wrote Azure Resource Manager (ARM) templates to create custom-sized Virtual Networks, Subnets, Virtual Machines, Load Balancers, and Network Security Groups
  • Configured and maintained Azure Functions to trigger when a Jenkins build is initiated, which internally gets stored on Azure Blob Storage for everybody to access
  • Used Network Security Groups, Azure Firewall, Azure Bastion, and Route tables to ensure a secure zone for organizations in the Azure public cloud
  • Created NAT Gateways and Proxy instances in Azure and managed route tables, Public IPs, and Network Security Groups
  • Configured Virtual Networks (VNet) with a network of subnets containing servers
  • Written bash and Python scripts integrating Azure SDK to supplement automation provided by Ansible and Terraform for tasks such as encrypting Azure Managed Disks and scheduling Azure Functions for routine Azure tasks
  • Utilized Azure Data Factory and Apache Airflow to automate data ingestion, transformation, and loading processes, reducing manual effort and improving efficiency
  • Optimized Apache Airflow DAGs (Directed Acyclic Graphs) for performance and reliability, implementing task retries, error handling, and dynamic DAG generation techniques
  • Implemented data backup, disaster recovery, and high-availability strategies, resulting in 99.99% uptime for critical data systems
  • Collaborated with data engineers and modelers to facilitate seamless data integration and flow between on-premises and cloud-based systems
  • Extensively involved in infrastructure as code, execution plans, resource graph, and change automation using Terraform
  • Implemented Azure DevOps Pipelines and Created ARM templates in Terraform
  • Automated Azure Monitor Dashboards and assisted internal users for Azure Sentinel in designing and maintaining production-quality dashboards
  • Implemented container orchestration using Azure Kubernetes Service (AKS), ensuring high availability and scalability of the application
  • Implemented API security measures using Azure Active Directory (AAD) and OAuth2, ensuring data privacy and integrity.

Cloud Engineer

IBM
09.2016 - 08.2018
  • Collaborated with cross-functional teams to execute the migration of on-prem resources, leveraging Red Hat OpenShift on IBM Cloud to modernize and optimize application environments for scalability and efficiency
  • Configured Red Hat OpenShift VPN-Point to Site, Virtual networks, Custom security, Endpoint security, and Firewall and designed and configured OpenShift Virtual Networks, subnets, network settings, DHCP address blocks, DNS settings, security policies, and routing
  • Provided high availability for IaaS VMs and PaaS role instances for access from other services in the VNets with OpenShift Internal Load Balancer
  • Worked on OpenShift App Insights, Alerts, and Log Analytics for Monitoring as part of the Operations Management Suite (OMS)
  • Used OMS & Power BI for visualizing activities
  • Created OpenShift Kubernetes Service along with the whole eco-system such as resource groups, virtual networks, storage accounts using Terraform and used Terragrunt to reuse the same code to deploy for different environments
  • Set up the Red Hat OpenShift build pipelines and release pipelines and built the Docker images and pushed them to Red Hat OpenShift container registry with appropriate tags
  • Implemented modules using Core Java APIs, Java collection, multi-threading, and object-oriented designs
  • Managed Java/J2EE enterprise applications within an agile environment, leveraging Python for automation and overseeing artifact management in the Jfrog repository
  • Deployed web applications into different application servers using Jenkins and implemented Automated Application Deployment using Chef
  • Created and modified build configuration files including pom.xml and converted build.xml into pom.xml to build the applications using Maven
  • Virtualized the servers with Docker for the Dev and Stage environments, and automated configurations using Docker containers
  • Written Chef Cookbooks, recipes to automate the installation of Middleware Infrastructure like Apache Tomcat, JDK, and configuration tasks for new environments
  • Developed and customized Splunk dashboards, visualizations, and configurations using advanced Splunk queries to meet specific requirements and enhance data analysis capabilities
  • Monitored Splunk authentication and permission for supporting large-scale Splunk deployments
  • Worked in creating, cloning Linux Virtual Machines, templates using VMware, and migrating servers between ESX hosts using VMotion and installed Firmware Upgrades, kernel patches, systems configuration, performance tuning on Unix/Linux systems.

Associate DevOps Engineer

Genpact
12.2014 - 09.2016
  • Designed, built, configured, tested, and installed software, managing all aspects of application development environments in AWS, emphasizing Java-based microservices and API development
  • Utilized AWS CloudFormation templates to design custom-sized VPCs, subnets, and NAT gateways, ensuring successful deployment of web applications and databases
  • Integrated automated build with deployment pipeline using Jenkins, improving throughput and efficiency of the build system through innovative automation techniques
  • Automated provisioning of cloud infrastructure using Chef significantly reducing costs and eliminating unnecessary resources
  • Replaced manual deployment and management processes with Chef and AWS OpsWorks stacks across various product platforms, enhancing scalability and consistency
  • Installed and supported multiple databases and applications, such as Oracle, MySQL, WebLogic, JBoss, and Apache Tomcat, ensuring optimal performance and reliability
  • Developed and organized Shell scripts for automation of complex software systems, streamlining deployment processes
  • Managed RHEL 5, 6, and 7 systems, including installation, testing, tuning, upgrading, patching, and troubleshooting on both P Series and VMware virtualization platforms
  • Scripted automation activities and builds using Perl, Bash, and batch files, improving operational efficiency and reducing manual intervention
  • Monitored system performance, tuned configurations, and managed logs to ensure optimal operation of cloud environments
  • Conducted TCP/IP networking troubleshooting and Linux/Network administration, identifying and resolving issues promptly
  • Collaborated with network and incident analysts to monitor current attack and threat information, proactively identifying potential security threats
  • Implemented Jira with Maven release plugin for bug tracking and defect management, ensuring efficient tracking and resolution of issues
  • Managed server virtualization using VMware ESXi and Oracle Virtual Manager, optimizing resource utilization and scalability
  • Coordinated with application teams to install, configure, and troubleshoot issues with JBoss servers, ensuring smooth operation of Java applications.

Education

Master of Science - Information Technology

University of New Hampshire
Manchester
09.2020

Bachelor of Technology - Electronics & Communication Engineering

The ICFAI University, Tripura
Agartala, Tripura, India
12.2011

Skills

  • Cloud Computing Expertise
  • Microservices Architecture
  • Continuous Integration & Continuous Development (CI/CD)
  • Containerization Technologies
  • Scripting Languages
  • Net Core
  • Java
  • Python
  • Application Program Interface(API)
  • Ansible
  • Terraform
  • GitHub
  • New Relic
  • Splunk

Timeline

Sr. DevOps Engineer

World Group Inc
06.2023 - Current

Site Reliability Engineer (SRE)

Capital One
05.2021 - 05.2023

Cloud Engineer

Sophos Ltd.
01.2021 - 04.2021

Cloud Infrastructure Intern

Geneia
08.2019 - 08.2020

Research Graduate Intern-Cloud Computing

University of New Hampshire
09.2018 - 08.2019

Cloud Engineer

IBM
09.2016 - 08.2018

Associate DevOps Engineer

Genpact
12.2014 - 09.2016

Master of Science - Information Technology

University of New Hampshire

Bachelor of Technology - Electronics & Communication Engineering

The ICFAI University, Tripura
Surya Kranthi Siyadri