Highly experienced Senior Site Reliability Engineer with over 8+ years of expertise in designing, deploying, and maintaining large-scale, highly available systems. Proven track record of leading cross-functional teams, driving strategic initiatives, and enhancing system reliability and performance. Seeking to leverage deep technical skills and leadership experience in a Principal Site Reliability Engineer role
• Complete migration from Bamboo/Bitbucket/Ansible to Terraform/GitHub.
• Develop and implement highly available and scalable architecture solutions both in the AWS cloud environment as well as on premise data centers.
• Script cron jobs to audit and clean out the underlying application systems. I.E: Log rotations and stale user directory clean-ups.
• Determine KPI's to develop a maturity model of the development teams. This project is written in Node.js and Mongo dB for the backend with React/Redux for the front end
• Create synthetic tests to mock user experience that alert and respond once failing a certain threshold.
• Analyze AppDynamics data to identify underperforming applications
• Developing automation tools and frameworks for 5G/LTE equipment.
• Developing, formulation, executing on/off target test plans for production quality code
• Continuous integration of software developed for subsystems in access stratum software.
• Work with test vendor equipment, to develop on-target tests; for specific scenarios, to ensure test coverage.
• Contribute to the automation efforts of running different tests on modem chipset and collect logs/results for further improvement.
• Developed a script that automatically sends out compiler warning, in each module, to relevant team leads.
• Developing framework to continuously monitor port issues using Python and C.
• Designing and developing a script that can automate the debugging for nearly 2000 end users and provide bug report.
• Developing Automated test scripts in Python to test the validator module on different buildplatforms.
• Implementing Diagnostics- dump Infra to collect physical interfaces debug information.
• Developed E-commerce site with AWS services, which supports multiple business units. Responsibilities included cloud setup and resolving tickets on daily basis within SLA.
• Deployed AWS Solutions using EC2, S3, EBS, ELB, auto scaling, Security groups, Cloud Formation, ECS, IAM, andRoute53.
• Created Virtual Private Cloud (VPC) with subnets and groups for servers and security groups to associate with the networks.
• Developed API for using AWS Lambda to manage the servers and run the code in AWS. Monitoring the server performance, CPU Utilization and disk usage using Cloud Watch and raising alarm in case of emergency.
Certified Oracle Cloud Infrastructure Architect-Associate
Certified Oracle AI Infrastructure Foundations Associate
Certified Oracle Foundations Associate