Dynamic Site Reliability Engineer with extensive expertise in incident response, cloud infrastructure, and automation for mission-critical systems. Skilled in leveraging AWS technologies (RDS, Aurora, DynamoDB), Terraform, and monitoring tools like Splunk and New Relic to enhance system reliability, scalability, and compliance. Delivered over 350 CORB jobs for large-scale ETL workflows and authored SOPs that reduced new-hire ramp-up time by 66%, while supporting 24/7 on-call rotations to swiftly resolve critical incidents. Proficient in Python and Bash scripting, with a strong track record of cross-team collaboration to develop robust, observable, and efficient systems.