Dynamic and results-oriented Ops Engineer with 8 years of experience in IT. Specialized in, implementing, and optimizing CI/CD pipelines, infrastructure automation, and cloud orchestration. A quick learner who can easily fit in a competitive team, who's eager to break barriers and perform at a higher level.
- Providing leadership in responding and resolving major incidents that impact business critical services, applications and infrastructure for Oracle Cloud Infrastructure
- Leveraging a broad technical expertise to convene appropriate SMEs (resolvers) and to direct Major Incident response, with focus on impact mitigation and service restoration
- Working closely with Subject Matter Experts to quickly identify customer impact
- Conducting escalations to service teams, senior management and leaders to ensure appropriate awareness, engagement and focus
- Producing accurate and timely communications tailored to the relevant audience
- Leading and/or participating in Post Incident Review and Problem Management meetings with key stakeholders and service owners to review events and opportunities for ongoing improvement
- Documenting pertinent information relating to Incidents that aids process improvement, identifies deviations and enables the creation of an Incident Knowledge Base
- Monitoring and evaluating high-level service and infrastructure dashboards and takes action to address identified anomalies
- Collating and analyzing incident based data for team metrics and KPIs
- Identifying opportunities and taking ownership for automation and/or continuous improvement of Incident Management process steps and best practices
- Proactively engaging with Service teams to identify and evaluate gaps in operational capabilities and improvements to support Cloud scalability and resiliency
- Representing Incident Management at relevant software team Roadmap planning and backlog reviews, influencing the prioritization of automation and tooling enhancements
- Working as part of the Major Incident Management team to ensure that the performance of the team achieves the defined performance targets and KPIs