Summary
Overview
Work History
Education
Skills
Certification
Technicalexpertise
Timeline
Generic
Raj Simbili

Raj Simbili

Nashville,TN

Summary

To leverage my passion for continuous learning and adaptability in a challenging role where I can align with organizational goals, swiftly become a reliable subject matter expert, and contribute to the long-term success of the team and organization.

  • As a Site Reliability Engineer (SRE), ensuring the optimal performance and availability of systems by continuously monitoring and troubleshooting issues.
  • Collaborate on developing and maintain robust automation and configuration management tools, and design and implement efficient system architectures and infrastructures. Responsibilities include creating and maintaining comprehensive system health and performance metrics.
  • Manage and execute yearly disaster recovery plans, and ensure thorough system documentation and Knowledge Transfer.
  • Develop and maintain sophisticated monitoring and alerting tools, and engage in performance tuning and optimization, system integrity and reliability.
  • IT experience in Healthcare, Telecom, Insurance, ISP sectors, with integrated heterogeneous on-prem environments and hybrid public cloud environments.
  • Take ownership and be accountable, foster a diverse, collaborative and inspiring workplace, work as one team across the firm, stay curious and look to improve together every day.
  • Expertise in SDLC, leading through all stages, requirements, design, build and support IT solutions in on-premise, hybrid / cloud environments.

Overview

25
25
years of professional experience
1
1
Certification

Work History

Consulting Application Engineer

HCA
10.2023 - Current
  • As a Site Reliability Engineer (SRE), role involves ensuring optimal performance and availability of systems by continuously monitoring and troubleshooting issues
  • Collaborate on developing and maintain robust automation and configuration management tools, and design and implement efficient system architectures and infrastructures
  • Responsibilities include creating and support comprehensive system health and performance metrics
  • Manage and execute yearly disaster recovery plans, and ensure thorough system documentation and Knowledge Transfer
  • Develop and maintain sophisticated monitoring and alerting tools, and engage in performance tuning and optimization, system integrity and reliability
  • IT experience in Healthcare, Telecom, Insurance, ISP sectors, with integrated heterogeneous on-prem environments and hybrid public cloud environments
  • Take ownership and be accountable, foster a diverse, collaborative and inspiring workplace, work as one team across the firm, stay curious and look to improve together every day
  • Expertise in SDLC, leading through all stages, requirements, design, build and support IT solutions in hybrid / cloud environments.
  • Managed 20+ on-premise application environments, 14 cloud tenants.

SRE Reliability Engineering - Sr. Systems Engineer

HCA
05.2014 - 10.2023
  • Responsible for build and support Infor ERP environments that are used for Dev / QA/ Perf/ UAT and Prod environments
  • Modules involved are Lawson System Foundation, Landmark IPA, EMSS, and ISS-Federation/Sync
  • Evaluate performance of AIX LPARs and Linux virtual guest for capacity planning purpose
  • Troubleshoot environment / platform related issues like slow server response, io wait, runaway processes on servers
  • Infor cloud solutions for GHR and Financials part of this solution involves on premise establishment of Infor enterprise connector, Gateway server hosting EMSS and S3 / IPA PDL.

Unix /Linux Systems Specialist

CHS Corporate
10.2011 - 05.2014
  • As systems specialist responsible for design, implementation and support infrastructure platform for CERNER, McKesson, Health Tech
  • Solution constitutes AIX Unix and Linux nodes, QAS, Tivoli, NimSoft tools installation and support
  • Engineering SAN space and network requirements for each project, work with vendor for new Hardware & Software procurement quotes, data center team to install the physical hardware, cabeling for network, SAN, console access
  • Work with vendors for preparing infrastructure for code upgrade scenarios Projects worked and supported.

Unix / Linux Systems Monitoring Implementation

TennCare
07.2011 - 10.2011
  • Design and Deploy the HP Sight Scope monitoring solution to all production Solaris Servers.

Sr. Deployment Engineer / Tech Lead for Systems / Network

Nokia Siemens Network
09.2010 - 04.2011
  • NSN provides infrastructure expansion / implement / support services to major cell phone companies in USA
  • This project in particular deals with subscriber’s capacity expansion from 40 mil to 60 mil for one of the leading service provider within USA
  • As deployment engineer responsible for Work on the engineering planning folder, technical and requirements document
  • Lead order placements for the hardware through Oracle; coordinate racking and hardware installation effort through data center installers
  • Server OS installation, Kernel tuning, PAM, LVM, Vsphere, NFS, LDAP, PAM, HA setup, patch analysis and upgrade, package analysis and upgrade
  • Network setup OAM and CORE, configuring routes, configuring interfaces on switches
  • Evaluate new products including open source and relevant solutions
  • Coordinate with support teams to trace root cause, resolution of application or server level issues...

Sr. Sys Admin RIS– Automation / Network

State Farm
08.2005 - 08.2010
  • RIS-Automation supports HP-OVO, HP-NNM, MS-SCOM, NAGIOS, CA-Ehealth, HP-Service Mgr., HP-Service Desk, the tools tied with custom integration code used to monitor, benchmark, and control very large heterogeneous Prod, Simprod, Dev networks
  • This includes Servers Windows, UNIX, Red, VMware, Mainframes and network devices
  • All this setup is adhering to ITIL model
  • As Technical Analyst responsible for Hardware design, install and setup new servers, integrating with LDAP, Volume manager setup, backup, cluster services, monitoring and troubleshooting
  • Coordinate with SAN team for ILO and LUN, dual channel setup
  • Patching servers, upgrade hardware and software during maintenance window
  • Installation, configuration, troubleshooting of Servers / Clients , HP-OVO 8.X, 7.X, on HP-Unix, IBM-AIX, Red Hat Linux 3.X, Solaris 9,10, Windows 2K, 2K3 servers
  • Worked with applications supported by the automation team, this includes configuring, scripting, and troubleshooting
  • The applications supported include Service Management tools that facilitate monitoring, Service Management Process (Incident, Change, Problem, Service Level, and Configuration Management), and network performance monitoring
  • Designing and coding of monitoring templates, Perl scripts, and CGI scripts
  • Support and troubleshoot router / switch issues, in process coordinate with AT&T for DATA / VOIP circuit restoration
  • Upgrading IOS, and configuration on nodes through automation process
  • Providing performance / capacity reports through Ehealth
  • Articulate Skill Training plan for reference purpose for training new comers to the team.

System Administrator of OFA Production Support Group

Perot Systems Inc.
07.1999 - 01.2000
  • Responsible For Installation, Maintenance and Administration of E4500, E450, HP N, K series servers and their related Hardware peripherals
  • Installation, upgrade of OS, third party software, User, File system Management, Backup & Restore by using VERITAS Netbackup, Legato Networker.

Project Team Member of Data vault/ UNIX Systems Administrator

Sun
03.2000 - 01.2002
  • Exodus Communications is client of Sun Microsystems, Sun offers professional services on doing backups and restores for Exodus Customers
  • Responsible For Installation, Administration of Sun and HP servers and workstations functioning as database, compiler, file, NFS and DNS, web and various other applications servers, and workstations
  • Configuring client machines for Data vault backups and restores, Troubleshooting the failed backups and restores, Coding scripts for MSSql Database Hot backups, using Netbackup 3.2 and troubleshooting failed hot backups
  • Coordinate with Arcus (data storage company) and NCC (exodus), with VERITAS and Sun Microsystems to work with some customer specific problems.

Member of 93 PP Move Project Team / Sys Admin

Edward Jones
07.2002 - 02.2003
  • This project is aimed to relocate all DCS UNIX Production / Development / Clone machines from existing data center to new data center
  • Responsible for To identify the machine/s characteristics, applications running on the machine, any dependencies
  • Coordinate with availability team to plan for move
  • To check out and make sure facilities availability, New IP’s, new hardware, network connectivity availability
  • Installing Shark Rack’s for E4500 / E4000, preparation of Rack’s.

Project Event Manager of Server Operations

AT&T
02.2002 - 02.2003
  • This project is aimed to maintain the production servers for AT&T Broad Band and World Net, deals with ISP cable modem services to customers providing services like email, Internet
  • Responsible for To provide support for the production servers hosting applications SAS, DHCP, DNS, Customer Reports, Imail, Web and News, Instant Messaging and chat, Web publishing
  • Support includes monitor alarms on all production servers using Remedy User, resolve the alarms based on severity.

Sys Admin for INMS PS Team

AT&T
02.2003 - 07.2005
  • Integrated Network Management System provides real time network monitoring capability for operations various platforms with in AT&T (Dial Platform, Common Backbone, VOIP, and World Net)
  • The systems within INMS capture, filter, threshold, correlate and present all alarms from these platforms
  • As INMS PST member responsible for To develop infrastructure requirements and configuration designs, translate the requirements in to implementation plan
  • Developing policies and procedures
  • Install and setup new servers, New Accounts, New IP addresses, Trouble Reports, Application Enhancement and Day-to-Day server and user administration
  • Maintain and support 2 mail servers that act as relay server within AT&T network
  • Security Policy Definition, Security Audit on Server OS, application through Axent reports, Port Scanner
  • Performance Tuning, Capacity Planning, Defining Backup & Restore Policy.
  • Trained new staff in front-of-house procedures, customer relations, and cleaning.

Education

Master’s in Business Administration - Finance & Systems

SKIM
India
05.1997

B.S in Engineering - Industrial & Production

SIT
India
05.1994

Skills

  • Infrastructure Automation , Scripting languages proficiency
  • Load Balancing Techniques, API Integration,
  • Network Administration, System Administration, Performance Tuning
  • ITIL framework knowledge, Containerization Technologies
  • Virtualization Technologies, Configuration Management
  • Disaster Recovery Planning, Continuous integration tools
  • Problem-Solving, Customer-Oriented, Testing and debugging, API design knowledge
  • Technical consulting, Cloud Computing, Technical Writing, DevOps principles
  • Virtualization Contiguous integration systems
  • Critical Thinking, Organizational Skills

Certification

  • Certified DevOps Practitioner – Digital Badge – GIT, Nexus, Docker, Jenkins, Container Orchestration, Terraform, Prometheus
  • Site Reliability Engineering – Foundations
  • GCP Cloud Fundamentals: Core Infrastructure, Data Engineering, Architecting – Compute Engine, Design & process
  • Infor IPA, S3, BIRST, Cloud GHR, Cloud FSM
  • GCP – Certified Associate Cloud Engineer

Technicalexpertise

Cloud Expertise

  • Public Cloud: Google Cloud, Microsoft Azure, Amazon Webservices, Digital Ocean, Linode
  • Virtualization: Docker, Kubernetes, Amazon EC2, Google Compute Engine
  • Infrastructure as Code:

              Provisioning: Terraform

             Configuration Management: Ansible, Puppet, Chef

  • Automation with Python, Perl, Korn, Bourne
  • Monitoring & Observability: Prometheus, Grafana, ELK Stack, GCP Stackdriver
  • Build Automation: CI/CD Pipelines
  • Containers: Docker
  • Container Orchestration: Kubernetes, GKE, EKS
  • Artifact Repo Manager: Sonatype Nexus, JFrog-Artifactory
  • Version control: Git, Jenkins, ADO, GitLab, GitHub, Bit Bucket, Subversion(SVN)
  • Build & Package: Gradle, npm, maven
  • v Database System: My SQL, Cloud SQL, Cloud Spanner, Bigtable
  • v Web and Application Server: Nginx, Mule ESB, Tomcat, JBOSS, WebSphere

OnPrem Expertise

Software & Hardware:

Operating Systems:

IBM AIX 7.X, 6.X, RHEL 6.X, 5.X, H.P. 10.X, 11.X, Solaris 10, CISCO IOS / Windows 2K8.

Virtualisation :

IBM VIOS, Pure Flex, VMware, CISCO UCS B230

Scripting Languages:

Scripting with PERL/Bourne/Korn, Java Script, HTML.

Web Services:

Apache Tomcat, Web Sphere, IBM-MQ, Rabbit-MQ, JBOSS, SUN one.

Database:

Oracle, DB2, MYSQL

Servers:

IBM P Series (P770, P740, P710), FSM, Pure Flex P270, HMC, X series, VMware Virtualization, SUN Fire 12K, 15K, 280R, V440, HP-900/800 RP5470, N, K, A, D, J, B series.

Timeline

Consulting Application Engineer

HCA
10.2023 - Current

SRE Reliability Engineering - Sr. Systems Engineer

HCA
05.2014 - 10.2023

Unix /Linux Systems Specialist

CHS Corporate
10.2011 - 05.2014

Unix / Linux Systems Monitoring Implementation

TennCare
07.2011 - 10.2011

Sr. Deployment Engineer / Tech Lead for Systems / Network

Nokia Siemens Network
09.2010 - 04.2011

Sr. Sys Admin RIS– Automation / Network

State Farm
08.2005 - 08.2010

Sys Admin for INMS PS Team

AT&T
02.2003 - 07.2005

Member of 93 PP Move Project Team / Sys Admin

Edward Jones
07.2002 - 02.2003

Project Event Manager of Server Operations

AT&T
02.2002 - 02.2003

Project Team Member of Data vault/ UNIX Systems Administrator

Sun
03.2000 - 01.2002

System Administrator of OFA Production Support Group

Perot Systems Inc.
07.1999 - 01.2000

Master’s in Business Administration - Finance & Systems

SKIM

B.S in Engineering - Industrial & Production

SIT
Raj Simbili