Creating and maintaining Infrastructure for applications using Terraform Infrastructure as a code
- using custom and community modules for reusable components
- using separate environments into different workspaces
- using remote backend for state storage
Assist/partner with the team members in scripting, setting up, maintaining CICD pipeline and automating the deployments using Jenkins and Terraform Cloud
- using Jenkins pipelines for each terraform repositories to deliver the code changes
Maintain infrastructure code in GitHub for better code management
Help managing and orchestrating containerized applications using Kubernetes clusters
- using EKS (Amazon Elastic Kubernetes Service) to create and manage K8s cluster
- using Helm charts and Helm providers for deploying and configure ArgoCd on the cluster
System and Application Monitoring (APM)using Datadog and AWS CloudWatch.
- monitoring application latency, errors rate, throughput and saturation
- monitoring infrastructure for cpu, memory and disk usage
- additionally we track standard Linux metrics such as inode utilization, to ensure system stability and performance
Collaborated with development teams to optimize code deployment and troubleshoot issues, resulting in improved application performance
Participate in and on call rotation
- we use PagerDuty for on-call rotation each team member 1 week at a time
Configured and deployed servers to support a new cloud-based application, resulting in improved scalability and performance
Developed data pipelines in cloud-native environments such as AWS and Google Cloud Platform
Maintaining the user accounts(IAM),RDS,Route 53 services in AWS
Automate deployment and configuration processes using Infrastructure as Code (IAC) tools such as Terraform and Ansible.
Utilized cloud computing technologies to reduce overall infrastructure costs
Researched and created documentation regarding new implementation procedures,
problem investigations and resolutions
Monitoring the servers using CloudWatch, Cloud Trail
Managed user accounts, passwords, and access privileges to ensure secure access to applications and data
Installed, configured, and maintained enterprise-level Linux and Windows server systems
Performed upgrades on servers, routers, switches, desktops, general office equipment
Amazon Web Services (AWS), Git, GitHub, GitActions
Linux System Administration, Terraform , Ansible Jenkins, Docker, Kubernetes, ArgoCd, Datadog, Bash, Python