Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Gebah Sandrine

Dallas,Texas

Summary

Innovative DevOps and Site Reliability Engineer with 8 years of experience in deploying, automating, and managing high-performance infrastructure and applications. A strategic thinker with strong problem-solving skills and an unwavering commitment to driving operational excellence.

Overview

8
8
years of professional experience
1
1
Certification

Work History

DevOps Engineer/Site Reliability

Presidio.inc
06.2018 - Current
  • Designed and implemented scalable, highly available, and fault-tolerant systems on [AWS/AZURE
  • Deploying, managing, and scaling containerized applications on Kubernetes and EKS
  • Define architectural standards and technologies to enhance our eCommerce platform
  • Implementation of build and deployment processes with CI/CD tools like Git, Jenkins and Maven, increasing efficiency and productivity by 80%, resulting in improved service reliability and 99.9% uptime
  • Developed and enforced service-level objectives (SLOs) and service-level agreements (SLAs) to ensure alignment with business goals and customer expectations
  • Automated system configurations using Ansible and Terraform, ensuring consistency, reliability, and performance across diverse environments
  • Improved incident management processes by 40%, streamlining communication and collaboration between teams and reducing mean time to resolution (MTTR)
  • Monitored and analyzed system performance using tools such as [Prometheus/Datadog/Nagios, Grafana], ensuring proactive issue detection and resolution
  • Led adoption of chaos engineering practices, enhancing resilience and reliability of critical systems and services
  • Design and automate database management processes including backup, recovery, and testing
  • Work closely with development teams to improve workflows around build, test, and deployment automation to reduce bottlenecks in delivery pipelines
  • Tested troubleshooting methods and documented resolutions for inclusion in knowledge base for support team use.
  • Mentored and trained team members in DevOps and SRE best practices, fostering a culture of continuous improvement and innovation
  • Deploying, managing, and scaling containerized applications on Kubernetes and EKS.
  • Provided 24/7 on-call support for critical systems, ensuring high availability and rapid issue resolution.
  • Monitored automated build and continuous software integration process to drive build/release failure resolution.

System Engineer

Central Reach
01.2016 - 05.2018
  • Implemented robust monitoring and logging solutions, enabling proactive identification and resolution of issues
  • Led the migration of on-premises infrastructure to a hybrid cloud environment achieving a 15% reduction in IT cost
  • Designed and executed disaster recovery and business continuity plans, ensuring resilience and data integrity
  • Collaborated with cross-functional teams to improve application architecture, performance, and scalability
  • Initiated system security procedures to close all unsecured ports
  • Systems monitoring and administration of Servers for day-to-day problems, patches, user administration, hardware failure, monitoring log files, backup, software up-gradation, configuration changes, and documentation
  • Providing production support for multiples applications residing on Linux and Windows
  • Monitored System Performance and troubleshooting issues arising from the servers
  • Implemented automated monitoring and alerting solutions, reducing incident response time by 30 %
  • Improved system reliability by implementing monitoring and continuous system improvement resulting in a 40% reduction in system downtime.
  • Developed custom scripts for automating routine tasks, increasing overall productivity.
  • Completed software updates and assessed security patches for optimized computer use.

Education

Bachelor’s Degree - Business Finance & Management

Cameroon University

Skills

  • Version Control ( Git, GitHub, Gitlab, Bitbucket, GitHub Actions, )
  • Monitoring (New Relic, AWS Datadog, ELK stack, CloudWatch , Prometheus/Grafana ,Splunk
  • CI Tool: Jenkins, Azure DevOps, AWS Code Pipeline
  • CD: Continuous Deployment (CD) tools like Kubernetes, Docker, Gitlab CI/CD, AWS Code Deploy, Ansible, Argo CD
  • Build Tool: Maven, Gradel
  • Scripting Languages: Powershell , Python, Bash
  • Quality Assurance and testing : Selenium Junit , Pytest, SonarQube
  • AWS Code pipeline: (Code build, Code Commit, Code Deploy)
  • Configuration management : Ansible , Terraform
  • Operating system : Linux , Windows
  • SAAS (Azure Devops)
  • Container Orchestration:(Kubernetes, OpenShift, Docker, AKS, RKE, EKS
  • Networking and security: [TCP/IP, VPN, Firewalls, Cloud Security]
  • Cloud ( AWS , GCP , AZURE )
  • Databases: MSSQL, Postgres

Certification

  • AWS Certified Solution Architect
  • Certified Kubernetes Administrator (CKA)
  • Hashicorp Terraform Certified Associate

Timeline

DevOps Engineer/Site Reliability

Presidio.inc
06.2018 - Current

System Engineer

Central Reach
01.2016 - 05.2018

Bachelor’s Degree - Business Finance & Management

Cameroon University
Gebah Sandrine