Summary
Overview
Work History
Education
Skills
Websites
Training
Awards
Accomplishments
Certification
Timeline
Generic

Banani Rath

Senior Engineer
Bellevue,WA

Summary

Innovative and solutions-driven Site Reliability Engineer (SRE) with over 10 years of experience in architectural design and cloud infrastructure. Specializing in building and optimizing scalable and resilient cloud architectures using Azure Well-Architected Framework principles. Proven ability to leverage Generative AI (GenAI) to automate processes, drive operational efficiency, and deliver innovative, scalable solutions. Expertise in Azure services, microservices architecture, and containerization technologies such as Kubernetes and Docker. Adept at improving system performance, ensuring high availability, and collaborating across teams to meet business goals while delivering exceptional customer experiences.

Overview

22
22
years of professional experience
4
4
years of post-secondary education
1
1
Certification

Work History

Site Reliability Engineer

Microsoft
Redmond, WA
09.2018 - Current
  • Designed and Delivered High-Impact Proof of Concepts (POCs): Created and showcased innovative POCs to key partners, successfully driving new business opportunities and expanding collaboration, ultimately strengthening team influence and positioning as a trusted partner in strategic initiatives
  • Architected and optimized large-scale systems using Azure Well-Architected Framework principles, ensuring best practices in reliability, cost optimization, operational excellence, performance efficiency, and security
  • Led the design and implementation of Generative AI (GenAI) models to create predictive analytics tools for performance monitoring, proactively addressing system bottlenecks and improving service reliability by 20%
  • Automated complex deployment processes using Kubernetes and Docker, streamlining continuous integration/continuous deployment (CI/CD) pipelines and reducing operational overhead by 30%
  • Collaborated with development and operations teams to design resilient cloud architectures, ensuring disaster recovery and high availability through the use of Azure services, including Azure Service Fabric, Azure SQL, and Blob Storage
  • Utilized AI-driven automation for self-healing infrastructure, improving mean time to resolution (MTTR) and increasing overall system uptime
  • Designed and implemented end-to-end monitoring solutions using Azure Monitor and Azure App Insights, providing actionable insights into application performance and system health
  • Drove continuous improvement initiatives through root cause analysis (RCA), developing engineered solutions that enhanced system stability and minimized downtime
  • Leveraged Python to automate infrastructure processes, including building custom scripts for configuration management, monitoring, and deployment in cloud environments
  • Designed and implemented Generative AI (GenAI) models using Python libraries to develop scalable AI-driven solutions for system health monitoring and predictive analytics, reducing downtime by 20%
  • Automated deployment pipelines using Python-based tools, significantly improving code deployment cycles and enhancing system efficiency
  • Aircapi Isolation & Release Freshness Improvement: Boosted Aircapi isolation from 48% to 95% by enhancing data collection processes, updating dashboards, educating teams on best practices, and resolving data inconsistencies
  • Led S360 work item management to ensure comprehensive coverage on resource isolation
  • Parity in Code and Release Freshness: Initiated and implemented a comprehensive exploration of code and release parity across platforms (Ev2, Engpipe, 1es, AzureBridge) to ensure up-to-date information and improved data handling
  • Proposed and drove a feature to enhance data freshness and reduce manual handling, fostering efficiency and collaboration
  • TSG Creation and Content Enrichment: Spearheaded the creation and enrichment of troubleshooting guides (TSGs), integrating AGC, EV2 deployment, CSS, and PIR data for enhanced troubleshooting accuracy
  • Built POCs and deployed improvements in production, optimizing content relevancy and effectiveness for DRIs
  • Responsible AI System Assessment: Conducted system-level information assessments for all AI systems, ensuring alignment with Responsible AI standards
  • Completed thorough evaluations to support transparency, accountability, and ethical compliance across AI operations
  • Security Enhancements under SFI: Prioritized security by implementing secure practices, including disabling public access to Azure ML, enforcing VNet integration, transitioning from key-based to Azure Entra authentication, and reducing overly permissive access in Azure DevOps
  • Developed and Deployed CosmosDB Resiliency Scorer: Spearheaded the introduction of an AI/ML-driven resilience scoring model for CosmosDB, positioning it as a critical reliability guardrail across Azure resources
  • Engineered a scalable, real-time prediction endpoint using binary classification to assess resilience across C+E CosmosDB instances, with seamless integration into Azure Functions and pipelines
  • Led the project from hack concept to productionization, completing 70-80% of tasks for handoff, ensuring a strong foundation for future AI advancements in Azure's reliability ecosystem
  • Evaluated new technologies and tools to enhance overall system performance, stability, and security.
  • Developed custom scripts/tools as needed to automate routine tasks, increasing overall team productivity and efficiency.

Site Reliability Engineer in MSDN Group(Microsoft)

PamTen
Redmond, WA
12.2015 - 09.2018
  • Manage on-Premise SQL Servers, Cloud and live site doc.microsoft.com site
  • Provision and manage the SQL Server Environment
  • Setup and Manage Always-on in SQL Server Environment
  • Resolve day-to-day database and application issues
  • Plan, implement all types of Backup (Full, Differential, Transactional) and Restore Operation as per business needs
  • Plan & Migrate SQL server from on-premise to Azure environment
  • Manage databases in azure and fine tune queries in SQL Azure
  • Setup and Manage replication from On premise to SQL Azure
  • Configure and manage daily and weekly maintenance jobs
  • Troubleshoot performance issues in Azure, Open Publishing, ClearDB issues
  • Manage keys and blob storage through storage explorer
  • Support applications hosted in WordPress
  • 24
  • 7 on call support and responsible for handling issues
  • Create Appinsight alert for Azure services
  • Troubleshooting issues with Builds in TFS and cloud
  • Ensure KPIs and SLAs are met as per contract
  • Checking Web Services and fix any site un-availability
  • Participate in MI & BIA and work on RCA and share with stakeholders
  • Certificate management for web servers, resolve webserver issues
  • Debug issue using various troubleshooting tools (Perfmon, Netmon, HTTP Watch, Fiddler, Microsoft service tracer, Log parser)
  • Work on maintenance activities like security patches, upgrade, hotfixes
  • Involved in cross platform calls in resolving Major Incidents
  • Attend daily sync up calls with the offshore team about the project status and hand off
  • Prepare weekly status report and share with management team
  • Co-ordinate with onsite and offshore team for completing pending tasks and for a smooth handoff
  • Apply service packs, Hotfix/monthly deployment on regular basis
  • Generating PowerBI Reports
  • SQL analytics to develop reports for customer facing
  • Generation SSRS and PowerBI reports using SQL Analytics
  • MDS and Azure security pack onboarding
  • Create and deploy worker role through VS, working in DevOps environment

Senior DBA(Microsoft)

PamTen
Redmond, Washington
05.2014 - 05.2015
  • Support a mission critical application in a heterogeneous environment which involves different technologies like SQL 2012, OLAP, SSRS, IIS and VSTF in high availability environment
  • Monitor SQL Server performance for excessive blocking, locking, and inefficient stored procedures and queries
  • Help developers rewrite procedures and modify database schema as necessary to resolve application bottlenecks
  • Installing and configuring .NET applications on IIS 8.0/7.5/6.0 Web servers on Windows 2012/2008 R2/2003 Servers
  • Migrating applications from IIS 6.0 to 7.5 as well as II6.0 to IIS 7.0
  • Deploying and managing applications in Datacenter, Virtual environment and Azure platform as well
  • Working on providing security by configuring SSL certificates as well as authentication techniques
  • Working on creating CSR (Certificate sign in request) and communicate with CA (Certificate authority) to get new certificates and configure them on web servers to provide security to internet facing web applications
  • Performance tuning, Index maintenance, Regular backup
  • Participate and deploy releases/hotfixes (Monthly and Quarterly)
  • Deploy ASP.NET and Web applications in IIS server
  • Environment Setup for new applications, onboarding projects on VSTF
  • Troubleshooting applications and resolve issues on day to day basis
  • Apply Patches and releases in production environment
  • Powershell scripting for install/uninstall IIS and password update
  • Install and Upgrade TFS and Visual studio
  • Resolve OLAP issues, Cube issues, Rebuild cubes and Warehouse from TFS console
  • Configuring SSRS and working with SQL server reporting server on day to day basis
  • Work on SCOM alerts and Issues
  • Certificate import and export and binding for web server, resolve IIS issues
  • Debug issue using various troubleshooting tools (Perfmon, Netmon, HTTP Watch, Fiddler, Microsoft service tracer, Log parser)
  • Work on maintenance activities like security patches, upgrade, hotfixes
  • Involved in cross platform calls in resolving Major Incidents
  • Attend daily sync up calls with the offshore team about the project status and hand off
  • Provide extensive project and developer support
  • Work with ISM/GCO/ServerIM/NETIM/DPSIM team for infrastructure issues
  • Support and participate in the on call rotation for maintenance and troubleshooting of IIS websites and windows servers

Service Engineer (Production & Pre-production Support)

PamTen
Redmond, WA
11.2013 - 05.2014
  • Manage multiple SQL 2012 and 2008 database servers in production and pre-production environments with databases large size DBs
  • The environment is high volume 24x7 OLTP with Microsoft SQL Server Always-On and database Mirroring
  • Monitor and Resolve tickets based on the priority
  • Environment setup/refresh for monthly and quarterly releases
  • Deploy Releases/hotfixes (quarterly, monthly): Deployment to Prod and Preprod environment
  • Configure Always On in all environments as part of environment refresh
  • Apply hotfixes & monthly security patching
  • Powershell scripting for user logins /deployment
  • Work with ISM/GCO/ServerIM team for infrastructure issues
  • Verify backups and error logs on the servers and troubleshoots any failures or alerts
  • Fix any database issues promptly
  • Monitor & Maintaining SQL Alerts and Jobs and act accordingly
  • Maintenance of Database like Indexing, Shrinking the Logs/data files of databases
  • Provide support of maintenance plans (including DBCC), backup, recovery, and refresh of development SQL Server databases and database capacity planning
  • Work with Dev, test and PMs for project related issues and releases
  • Checking Web Services and fix any site un-availability
  • Participate in MI & BIA and work on RCA and share with stakeholders
  • Prepare weekly status report and share with management team
  • Co-ordinate with onsite and offshore team for completing pending tasks and for a smooth handoff

SQL DBA, Siebel Apps DBA, SQL Developer (SSIS)

Patni Computer Systems Limited
, Alaska
08.2007 - 04.2008
  • Maintain SQL Servers and Databases
  • Migrating data from DB2 to SQL Server 2005
  • Develop SSIS packages
  • Performance monitoring, Space Management, Schedule Jobs
  • Create database maintenance plan
  • Shrinking the Logs/data files of databases
  • Index rebuilding on regular basis
  • Creation of logins, manage space, verify backups, error logs on the servers and troubleshoots any failures or alerts
  • Fix any database issues promptly
  • Monitoring & Maintaining SQL Alerts and Jobs
  • Support Web applications
  • Work with Dev, Test, PMs for project related issues and releases
  • Creation of Packages & report testing using SSRS & SSIS
  • Apply Builds and releases
  • Apply hotfixes & monthly security patching
  • Supporting Siebel 8.0 application, ADM Export & Import
  • Prepare document for all KEDB

Database Administrator

Source Pro Technologies Pvt Ltd
06.2006 - 08.2007
  • Company Overview: Source Pro is a sister company of EWIE Co, Inc, US which is in to business of supply chain optimization for metalworking and industrial supply commodities which provides Cost Savings while increasing customer satisfaction
  • Source Pro is completely into IT business/software development of EWIE
  • Participate in planning, preparation and deployment of SQL and Web applications
  • Validate and review phases of deployment activities
  • Troubleshoot SQL Servers, Application related Issues
  • Install & configure SQL Servers
  • Maintaining SQL Server and databases
  • Backup and restore of Databases
  • Creating and implementing disaster recovery plan
  • Solving day to day problems, Trouble shooting
  • Database security, Performance monitoring, Space Management
  • Creating & Scheduling of Jobs
  • Creating database maintenance plan
  • Creation of Packages & report testing using SSRS & SSIS
  • Installation, Configuration & Monitoring MOM
  • Co-coordinating with onsite team on daily basis
  • Document resolution of all issues
  • Source Pro is a sister company of EWIE Co, Inc, US which is in to business of supply chain optimization for metalworking and industrial supply commodities which provides Cost Savings while increasing customer satisfaction
  • Source Pro is completely into IT business/software development of EWIE

Database Administrator

Spectrum Magazines Limited
11.2003 - 01.2006
  • Installation of SQL server
  • Maintain SQL Server and databases
  • Solving day to day problems, Trouble shooting
  • Deploying applications
  • Database security
  • Performance monitoring
  • Creating database maintenance plan
  • Setting up & monitoring replication
  • Perform backup and restore periodically
  • Configure/trouble-shoot SQL Server connectivity

Instructor

SKDAV College
10.2002 - 10.2003

Education

B. Tech - Computer Science & Engineering

Berhampur University
Berhampur, Odisha
01.1998 - 01.2002

Project Management -

Bellevue College
Bellevue, Washington

Skills

Capacity planning

Microservices architecture

Configuration management

Security best practices

Disaster recovery

Database administration

Performance tuning

Software development

Infrastructure automation

Continuous deployment

Application scaling

Data visualization and presentations

Python programming

Training

Project Management from Bellevue College.

Awards

Rewarded as star of the quarter for Valuable contribution By Patni Computers System.

Accomplishments

  • Established standards for AI/ML by leading multiple PoCs and hackathons, introducing AI/ML technologies to the team, and developing tools to strengthen guardrails and enhance operational efficiency
  • Implemented a Machine Learning Resiliency Scorer, achieving 95% resiliency for most of cosmosdb system and significantly enhancing reliability in critical tasks.
  • Achieved parity for 25+ systems across diverse cloud locations, ensuring consistency and alignment across varying dimensions.
  • Achieved 99.99% Resiliency for MicrosoftLearn
  • Achieved parity for 25+ systems across diverse cloud locations, ensuring consistency and alignment across varying dimensions.
  • Successfully migrated docs.microsoft.com to learn.microsoft.com, ensuring a seamless transition with zero downtime.

Certification

MCTS

Timeline

Site Reliability Engineer

Microsoft
09.2018 - Current

Site Reliability Engineer in MSDN Group(Microsoft)

PamTen
12.2015 - 09.2018

Senior DBA(Microsoft)

PamTen
05.2014 - 05.2015

Service Engineer (Production & Pre-production Support)

PamTen
11.2013 - 05.2014

MCTS

09-2007

SQL DBA, Siebel Apps DBA, SQL Developer (SSIS)

Patni Computer Systems Limited
08.2007 - 04.2008

Database Administrator

Source Pro Technologies Pvt Ltd
06.2006 - 08.2007

Database Administrator

Spectrum Magazines Limited
11.2003 - 01.2006

Instructor

SKDAV College
10.2002 - 10.2003

B. Tech - Computer Science & Engineering

Berhampur University
01.1998 - 01.2002

Project Management -

Bellevue College
Banani RathSenior Engineer