Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
Languages
Training
Work Availability
Timeline
Sakul Koirala

Sakul Koirala

Pflugerville,TX

Summary

Results-oriented Unix/Linux/Middleware-based SRE with over 11 years of experience in server support, cloud computing, and application deployment. Proficient in working with various flavors of Linux, AIX and Windows. Skilled in containerization technologies such as Docker and Kubernetes. Proven track record of managing complex infrastructure and delivering highly available systems. . Offering excellent communication skills and proven to collaborate cross-functionally with IT and project teams to execute wide-scale system upgrades.

Overview

11
11
years of professional experience
1
1
Certification

Work History

Unix/Linux System Engineer/SRE

General Motors
06.2014 - Current
  • Managed Unix/Linux servers in hybrid cloud environment, overseeing virtual machines and physical hosts in multiple data centers.
  • Provided 2nd and 3rd level technical /operational support, managing critical and severity-based incidents, while actively contributing to major project initiatives.
  • Collaborated cross-functionally with developers, product managers, and SREs to ensure uninterrupted system performance and effective incident management
  • Maintained highly available and secure infrastructure by adhering to site reliability principles, leveraging continuous monitoring and alerting, and promoting global/offshore coworker communication and collaboration
  • Possess expertise in networking, Linux internals, web architecture, and related topics, with specific emphasis on Linux filesystems, applications and database support,
  • Wrote and maintained custom scripts to increase system efficiency and performance time.
  • Developed automation tools for application deployment using Ansible and Chef, and created Docker images for middleware applications, deploying them with Kubernetes
  • Proactively monitored systems to minimize incidents, and performed root cause analysis for production issues, resolving critical incidents
  • Established monitoring and alerting systems using Nagios, as well as Oracle Management Cloud and Google Cloud Operations suite
  • Managed and remediated escalated customer issues, collaborating with cross-functional teams, and continuously supported different incidents/changes to deliver satisfactory resolutions, following project management best practices.
  • Worked closely with customers, internal staff and other stakeholders to determine planning, implementation and integration of system-oriented projects.
  • Monitored and tested application performance to identify potential bottlenecks, develop solutions, and collaborate with developers on solution implementation.
  • Installed, configured, tested and maintained operating systems, application software and system management tools.
  • Performed duties in accordance with applicable standards, policies and regulatory guidelines to promote safe working environment.
  • Gained strong leadership skills by managing projects from start to finish
  • Participated in system development life cycle from requirements analysis through system implementation

Unix/Linux System Administrator

IBM
02.2012 - 06.2014
  • Expertly installed and configured Solaris 9/10/11, AIX 5.3 6.1, 7.1and RHEL 4 and 5 to meet project requirements.
  • Created file systems using Veritas Volume Manager, Solaris Volume Manager, and Logical Volume Manager to optimize system performance
  • Managed SAN storage and coordinated with SAN teams to upgrade Emulex cards and firmware, troubleshoot fabric issues, and ensure systems are up to date
  • Reviewed High and Critical Incident Tickets and provided comprehensive Root Cause Analysis (RCA) reports as necessary
  • Monitored system log files for errors, configured cron jobs for backups and process monitoring, and efficiently created and managed user accounts
  • Proactively implemented OS patches and implemented Secure Shell (SSH) for enhanced system security
  • Utilized Solaris Containers (zones), Logical Domains, and Virtual I/O for effective server virtualization.
  • Managed backup and disaster recovery through strict data control and retention policies, personally handling recovery tasks when issues arose
  • Analyzed network traffic and performance metrics to optimize system performance
  • Installed system-wide hardware components, confirming interoperation and compatibility with Linux-based software distros
  • Preserved system documentation accuracy via regular data updates and graphical refreshes
  • Designed disaster recovery systems, enabling continuity in event of power outages
  • Installed and configured network printers and other peripheral devices
  • Implemented, developed and tested installation and update of file servers, print servers and application servers

Education

Bachelor's Degree -

Tribhuwan University, Kathmandu, Nepal
02.2004

Skills

  • Operating systems: Windows, Linux (Red Hat, CentOS, Ubuntu, Debian), AIX, Solaris
  • Hardware : DELL - Poweredge/RX Series, IBM Power Series, HP-Proliant, SPARC, SPARC STATION (10,20), M-Series and T-Series servers, EMC Storage Arrays, SAN, 3par, xio, vmax, vnx, NAS-isilon, Cisco, Brocade, Peripheral devices ( Printers and modems)
  • Cloud computing: Oracle, AWS, Microsoft Azure, Google Cloud Platform
  • Middleware applications: WebSphere, JBoss, Tomcat, Apache, NGINX, Microsoft Viva,
  • Containerization: Docker, Kubernetes
  • Automation tools: Ansible, Chef, Jenkins, Git, Docker
  • Programming Language: Bash, Python, Ruby, Java
  • Monitoring and alerting: Nagios, ELK stack, OracleWatch
  • Cluster: AIX HACMP, Sun Cluster, HP-UX Cluster, VERITAS Cluster Server, Red Hat Cluster Server,
  • Production Work, SDLC, CICD Pipeline
  • Web Applications, Agile Methodology,
  • Training Junior Team Members

Accomplishments

    Implemented robust security measures and disaster recovery plans to protect company data and ensure business continuity. Conducted security audits and vulnerability assessments, implemented encryption and access controls, and developed and tested disaster recovery procedures. As a result, the company achieved compliance with industry regulations and significantly reduced risk of data loss or system downtime in the event of a security breach or disaster.


    Streamlined system administration processes to increase efficiency and reduce downtime. Developed and implemented new system monitoring tools and automated routine tasks, resulting in a 30% reduction in system downtime and freeing up time for more strategic projects.


    Successfully planned and executed a complex system migration project from physical servers to a virtualized environment. Led a cross-functional team to ensure seamless migration of critical applications and data while minimizing disruption to users. Project was completed on time and under budget.


Certification

Oracle Solaris Certified System Administrator

Languages

English
Full Professional
Hindi
Full Professional
Nepali
Native or Bilingual

Training

AWS

Microsoft Azure

K8s

Ansible

Red Hat Linux

AIX HACMP, VIOs

SRE/DEVOPS

Work Availability

monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse

Timeline

Unix/Linux System Engineer/SRE - General Motors
06.2014 - Current
Unix/Linux System Administrator - IBM
02.2012 - 06.2014
Tribhuwan University - Bachelor's Degree,
Sakul Koirala