Summary
Overview
Work History
Education
Skills
Certification
LinkeIn
Timeline
Generic

RODRIGUE AIME NGONGANG

Adelphi,MD

Summary

Senior Site Reliability Engineer / DevOps/System Engineer / Cloud Engineer / Red hat certified Linux System Administrator / Certified Kubernetes Administrator, I am a motivated IT professional with hands on systems engineering, System configuration, automation/deployment Enthusiastic team player , always looking for innovative and efficient engineering solutions, Energetic self-starter capable of learning quickly with minimal guidance. I am seeking to progress my career in the Information Technology sector where I will use my skills and experience in system maintenance and technical troubleshooting to contribute to an active growth and productivity of the company. I am a permanent resident of the united state of America.

Overview

10
10
years of professional experience
1
1
Certification

Work History

Senior Site Reliability Engineer

TRADESTATION Technology
07.2022 - Current
  • Configured VPC endpoint and endpoint service to connect to an endpoint service in the Ireland region using transit gateway and transit gateway attachment
  • Troubleshooted and resolved issues related to Pod Life-cycle Events Generator (PLEG) in a live production environment
  • Support in the implementation and troubleshooting of crypto related components such as Liquidity provider, Market-Data, Order-Execution, and Drop-Copy feed
  • Utilize AWS Secret Manager for secure storage and management of sensitive information and worked with Redshift, a data warehousing solution, to analyze and process large datasets
  • Created robust set of alerts on Logz.io and Datadog, implemented monitoring and observability using Datadog and alarms ahead on critical system components failure
  • Use Kubernetes to manage container applications using its nodes, Config Maps, selectors, and Services & deploy application containers as Pods and managed Kubernetes charts using Helm, Created reproducible builds of the Kubernetes applications, managed Kubernetes manifest files, and managed Helm package releases
  • Use Terraform and Ansible to manage and provision infrastructure resources in an automated and scalable manner
  • Work with REST APIs for integration and automation of various services
  • Integrated Docker Mend agent for scanning Docker container base images and utilized Hubble tool for troubleshooting and monitoring of network and application performance
  • Implement Python automation scripts to streamline operational tasks
  • Configured and deployed network policies ingresses and http proxies to enforce Kubernetes cluster security and reduce vulnerability
  • Implemented and managed LinkerD service mesh for improved reliability and observability of microservices architecture
  • Created and deployed lambda function for application deployment and automation
  • Developed and implemented robust monitoring and observability solutions to track the SLIs and ensure that they meet the defined SLOs
  • On-call rotational service to capture and mitigate all after hours incidents
  • Conducted post-incident analysis and documenting the RCA to avoid future occurrence

DevOps Engineer / Cloud Engineer

E*TRADE Financial
09.2020 - 07.2022
  • ( remote )
  • Establishing a complete DevOps pipeline (Git – Jenkins – Ansible/Terraform – Docker - Kubernetes)
  • Developed and implemented a monthly cost reduction plan , enforcing CloudHealth recommendation and implementing spot.io instances in all lower environments
  • Implemented cluster services using Docker and Kubernetes to manage local deployments in Kubernetes by building a self-hosted Kubernetes cluster using Terraform and Ansible and deploying application containers
  • Involved in building and maintaining Highly Available secure multi-zone AWS cloud infrastructure utilizing Terraform, Ansible with AWS CloudFormation, and Jenkins for continuous integration
  • Worked in AWS environment, instrumental in utilizing Compute Services (EC2, ELB), Storage Services (S3, Elastic Block Storage), Elastic Beanstalk, VPC, SNS, IAM, and Cloud Watch
  • Used IAM to assign roles, and to create and manage AWS users, groups, and permissions to use AWS resources
  • Created and deployed lambda function for application deployment and automation and reduced cost of computing by 20%
  • Created EBS volumes to store persistent data and mitigate failure using snapshots
  • Performed Data backup of Amazon EBS volumes to S3 by taking point-in-time snapshots
  • Perform periodic maintenance of the Kubernetes cluster by draining the pod for Nodes upgrade while maintaining the cluster highly available
  • Building Aws infrastructure in the AWS cloud using VPC, EC2, IAM, Route53, S3, ELB
  • Automated configuration management and deployments using Ansible playbooks and YAML for resource declaration
  • And creating roles and updating Playbooks to provision servers by using Ansible
  • Automating the deployment of developer codes and applications using Git, Git hub and Jenkins
  • Designed and implemented cloud-based solutions using AWS services
  • Monitored and optimized AWS cloud infrastructure performance
  • Developed and maintain automation scripts to manage cloud infrastructure
  • Designed and implemented security measures to protect cloud infrastructure
  • Configured and maintained AWS services such as EC2, S3, RDS, and VPC
  • Troubleshooted and resolved AWS cloud infrastructure issues
  • Collaborated with other teams to ensure successful implementation of cloud-based solutions
  • Developed and maintained cost optimization strategies for AWS cloud infrastructure.

Sr System Engineer / SRE

Elsevier Technology INC
09.2018 - 08.2020
  • Home based Employee)
  • Manage DevOps CICD Pipeline by using various tools, like Git(for version control systems), Jenkins(for continuous integration), Maven( as a build tools), Ansible(for continues Development and configuration management ), Docker( for containerization), k8s(for container orchestration) Cloud like Aws ( for creating EC2, S3, IAM ,VPC, Security Group etc..)
  • Automate, perform upgrades, and Patching using Ansible
  • Install and configure operating systems, software, and hardware components, and leverage IT staff for routine tasks by clearly documenting design, maintenance, and support procedures
  • System performance tuning and basics in monitoring, analyzing system and application logs, vulnerability assessments
  • Design and implement CI/CD pipelines using tools such as Jenkins and GitLab CI/CD, reducing deployment time by 30%.Automate User/group administration, file/directory security, authentication and access management (SSH, Firewalls) using Ansible
  • Ensured that architecture and deployment models are sufficient to support SLA commitments
  • Conducted high-level root-cause analysis of service interruptions and establish preventive measures
  • Analyzed and interpreted system and application log files
  • Established a CI/CD framework through tight engagement with engineering team
  • Provide regular updates and recommendations to Sr Leadership regarding reliability and ed of the CI/CD environment
  • Configuring SElinux and Tripwire intrusion detection security software
  • Builded and performed Jenkins job for deployment nginx, tomcat, Jenkins
  • Set up Jenkins master, add the necessary plugins, and add more enslaved people to support scalability and agility
  • Collaborated with Development and production teams to ensure works smooth running of the pipeline
  • Monitor systems and diagnose, troubleshoot, and resolve hardware/software/application issues
  • Worked closely with Site reliability engineer team charger with configuring, monitoring, diagnosing, troubleshooting, and resolving incidents tickets on servers hardware issues.

Linux System Engineer/ DevOps

PANI TECH
02.2014 - 09.2018
  • Collaborated with Development and Production teams to ensure smooth running of the pipeline
  • Diagnosed application memory leaks, identify and fix issues related to SElinux, and identify library
  • Worked with Dev team, making modifications on the code using Git VCS to clone, add, commit and push codes from local and master branches to central repositories
  • Wrote Ansible playbooks and bash scripts to automate the deployment and management of software packages, services, firewall rules, file systems, storage systems, job scheduling, security, and systems resource monitoring
  • Worked with various flavor of linux os such as RedHat, Ubuntu, CentOS in large scale environment
  • Enabled branching strategies and managed Git repositories
  • Automated the deployment of developer codes and applications using Git, Git-Hub, and Jenkins
  • Build and managed AWS infrastructure in the AWS cloud EC2 instance, EBS, ELB, S3 Buckets, IAM, Route 53
  • Created various Bash Shell scripts for automation of repetitive and manuals task
  • Performed stress, sanity and penetration test to ensure the smooth running of application
  • Setup Kubernetes Deployment on-premise and on the Cloud ( AWS )
  • Automated the deployment of Docker images to Docker registry using Jenkins
  • Used Kubernetes to orchestrate the deployment, scaling, and management of Docker container
  • Administered Docker CI/CD pipelines to build, test and deploy code/applications
  • Troubleshoot and fixed networking and application performance issues
  • Installed, configured, and administered HAproxy load-balancer Apache web server
  • Automate and standardize server configurations with Ansible Playbooks, shell, or python scripts
  • Configure and maintain Jenkins to implement the CI process and integrate the tool with Maven to schedule the builds
  • Took sole responsibility for maintaining the CI server
  • Monitor the servers and Linux scripts regularly and perform troubleshooting steps like testing and installing the latest software on the server for end-users.

Education

Bachelor of Applied Science - Business Management

UNIVERSITY OF DOUALA
07.2010

GED - Mathematics And Computer Science

MANJO High School CAMEROON
06.2007

Skills

  • Version Control:
  • Git, GitHub, Gitlab
  • Automation/Deployment: Ansible, Terraform, Docker Swarm, CloudFormation, Lambda
  • Containerization: Docker, Kubernetes
  • Ticketing: Jira/Kanban, Remedy, Confluence, ServiceNow
  • Tanium Vulnerability
  • LAMP Stack, NGNIX
  • Coding: Bash, Python (intermediate)
  • Load Balancing: HAPROXY, Teaming, Bonding
  • CloudHealth, VMware
  • APM: New Relic, DataDog
  • Tools: K9s, Postman, MySQL workbench, Hubble, VScode, Linux, Ubuntu, service Mesh
  • Continuous Integration: Jenkins, GitlabCI
  • Continuous Monitoring: Nagios, Splunk, Datadog, New Relic
  • Database: MariaDB, MySQL
  • HTTP, DHCP, DNS protocol
  • Virtualization
  • Networking, TCP/IP, SFTP, SSH, UDP SAMBA, NFS
  • Micro environment monitoring: Prometheus, Grafana, Logzio
  • Cloud environment: AWS
  • BigPanda, ControlM
  • CloudFlare CDNetwork
  • Opsgenie

Certification

RED HAT CERTIFIED SYSTEM ADMINISTRATOR CERTIFIED KUBERNETES ADMINISTRATOR CERTIFIED TERRAFORM ASSOCIATE ( IN PROGRESS )

LinkeIn

https://www.linkedin.com/in/rodrigue-aime-ngongang-220835116

Timeline

Senior Site Reliability Engineer

TRADESTATION Technology
07.2022 - Current

DevOps Engineer / Cloud Engineer

E*TRADE Financial
09.2020 - 07.2022

Sr System Engineer / SRE

Elsevier Technology INC
09.2018 - 08.2020

Linux System Engineer/ DevOps

PANI TECH
02.2014 - 09.2018

Bachelor of Applied Science - Business Management

UNIVERSITY OF DOUALA

GED - Mathematics And Computer Science

MANJO High School CAMEROON
RODRIGUE AIME NGONGANG