I enjoy methodically solving distributed system issues. I’ve been the primary point of contact for Tier 6 customers for the past 8 years handling scalability issues across Data Platform Stack on-premise and on kubernetes.
Overview
13
13
years of professional experience
1
1
Certification
Work History
Senior Staff Premier Support Engineer
Cloudera
Cary, NC
01.2021 - Current
Troubleshooting critical issues on multi tenant cluster across storage(HDFS, Apache Ozone), processing (Spark, Hive, Impala, Yarn) and security layers (Ranger, Venafi, kerberos)
Troubleshoot issues in containerized ECS platform (Rancher) and longhorn (cloud-native distributed block storage system)
Developed deep dive runbook for troubleshooting overlay network issues in rancher clusters and addressing scalability challenges in Apache Ozone, enhancing troubleshooting efficiency.
Conducted root cause analysis to differentiate customer configuration issues from product defects, documenting bugs with clear reproduction steps to expedite resolution.
Advocated for customer technical support by tracking feature enhancement requests and bugs.
Collaborated with quality engineering teams to identify scalability bottlenecks and track long-term design fixes, ensuring system reliability and performance.
Worked closely with Solution engineers and resident architects and product team ensuring the critical features expected by customers are on track for delivery in the product and customer project pipelines are achieved as per deadline.
Contributing bug fixes and enhancements to Apache Ozone open-source project (HDDS-12703, HDDS-12404, HDDS-12288, HDDS-12194), improving observability, admin tooling, and operational visibility in production releases.
Staff Premier Support Engineer
Cloudera
Chennai
02.2021 - 09.2021
Served as a dedicated support engineer for the company’s second-largest customer, providing specialized technical assistance and ensuring high service reliability.
Enhanced performance tuning for Impala, HDFS, Hive, BDR, Spark, and Hbase, optimizing system efficiency and reliability.
Validated customer cluster configurations and system settings, ensuring alignment with best practices for optimal performance.
Resolved scalability issues on HDFS, Impala, and BDR regularly.
Conducted regular synchronization calls with customers to discuss upgrade plans, address errors, and clarify documentation, utilizing validators to proactively detect and deflect issues.
Engaged in cross-pod collaboration, expanding knowledge across a wide range of components in the Hadoop ecosystem.
Raised feature and bug requests and contributed to documentation in Jira.
Senior Premier Support Engineer
Cloudera
Chennai
02.2017 - 01.2021
Managed general queue and specialized in analytics customer accounts.
Performed root cause analysis and delivered troubleshooting solutions.
Led performance optimization efforts in Impala by fine-tuning customer queries and delivering best-practice recommendations to ensure SLA compliance.
Developed custom scripts to parse query profiles, enhancing performance analysis and uncovering optimization opportunities.
Designated Support Engineer
Cloudera
Chennai
02.2016 - 01.2017
Onboarded customers to cloud infrastructure, facilitating smooth transition and integration.
Collaborated with Cloudera director engineering and account teams to identify migration issues and develop tailored solutions.
Provided technical support for data management and analytics platforms.
Troubleshot and resolved customer issues with Cloudera software solutions.
Collaborated with cross-functional teams to enhance product functionality.
Customer Operations Engineer
Cloudera
Chennai
11.2015 - 01.2016
Started as a technical support engineer in the Access and Integration pod.
Primary focus on addressing issues related to Impala, Oozie, Sqoop, Hive, and Sentry.
Utilized this phase to study the security layer, including Kerberos, LDAP, SSL, along with the above components.
Hadoop Cluster Admin
Tata Consultancy Services
Chennai, Tamilnadu
01.2013 - 01.2015
Designed and executed the seamless migration of vanilla Apache Hadoop and Cassandra clusters with zero downtime.
Played a key role in performance tuning and capacity planning for both Cassandra and Hadoop clusters.
Developed shell scripts to automate tasks such as purging application logs, metadata backups for Namenode, MySQL metastore backups, and Cassandra snapshot removal.
Created a Flume client code and custom source component to facilitate compressed data ingestion from remote machines.
Tested and validated failover and load-balancing of data across Flume agents.
Education
Bachelor of Technology - Information Technology
Pondicherry Engineering College
Pondicherry
01.2013
Skills
Expertise in resolving sub-ms latency issues in distributed storage system like HDFS, Apache Ozone
Big Data Ecosystem:
Storage: Ozone, Hadoop, HBase
Data Processing: Impala, Hive, Spark, BDR, Yarn, Iceberg
Security: Ranger, Kerberos, TLS
Containers & Orchestration: Cloudera Embedded Container Service (Rancher/Longhorn)
Devops Cloud Data Engineer Azure at End Client UK Public Sector (DWP, MoJ, MoD)Devops Cloud Data Engineer Azure at End Client UK Public Sector (DWP, MoJ, MoD)