Site Reliability Engineer
- Collaborated with development teams to troubleshoot and resolve production issues effectively.
- Developed documentation for operational procedures and system configurations.
- Participated in after-hours on-call support rotation to ensure system availability.
- Performed root cause analysis of production incidents, providing actionable recommendations.
- Implemented monitoring solutions to enhance system reliability and performance.
- Monitored critical infrastructure for performance and reliability issues.
- Assisted in automation script development to streamline processes and reduce manual tasks.
- Maintained infrastructure using cloud technologies and configuration management tools.