Detail-oriented team player with strong organizational skills. Ability to handle multiple projects simultaneously with a high degree of accuracy.
Overview
9
9
years of professional experience
Work History
Senior Software Engineer
Amazon
02.2022 - Current
Part of team that designed and built Big data Infrastructure for Amazon Buy with Prime product.
Built connectors that can collect data from across different domain teams like order, payment, product etc.
Event Driven Architecture for event ingestion, processing and serving to consumers leveraging Amazon Event Bridge, API gateway, SQS, SNS and lambda.
Event Driven Architecture for providing big data compute on demand leveraging Amazon SQS, SNS, EMR and lambda.
Data Mesh implementation for providing a unified data lake with schema evolution using avro and Glue schema registry
Senior BigData Solutions Architect
AWS
06.2020 - 01.2022
Engaged customers with EMR on EKS product demo, understand customer requirements, propose architecture, best practices, support with on-boarding including code migration, performance testing, deployment and job monitoring.
Implementation to integrate spark monitor with EMR On EKS for interactive notebook based workloads which enabled to on-board big customers.
Contributed to design reviews within EMR on EKS product team for multiple features like Custom Images, notebook support and storage options.
Published white paper on EMR on EKS observability and helped multiple customers to setup monitoring and alerting for their spark workloads.
Published the learnings from customer engagements as EMR on EKS best practices .
Evangelized EMR on EKS across bigdata community through deep dives.
Co-developed EMR on EKS Airflow operator .
Load test EMR on EKS and establish benchmarks for large Fin tech customers.
Software Development Manager
Nordstrom
03.2018 - 05.2020
Led a team of 6 developers to build a Hadoop platform on top of Amazon EMR from scratch
Collaborated with other engineering teams, developers, data scientists, business analysts, tech leadership to create a roadmap for the platform and delivered over multiple release cycles
Collaborated with security and privacy teams to identify vulnerabilities, evaluate options, document design approach and drive execution to deliver the features
Created a hybrid architecture to a seamless scale between on-prem and cloud
Constantly achieve AWS cost reduction with clusters on-demand, idle cluster termination, cluster profiles to support workload patterns, custom auto-scaling rules, spot instance usage, RI coverage, Utilization metrics, explore APM tools like new relic and Unravel
REST API for cluster provisioning, slack bot integration, airflow operator for cluster provisioning to make the customer experience better
Infrastructure as code using Terraform
Played the role of a scrum master, maintained the JIRA board. Guided the team to mature DevOps practices like CICD pipeline, immutable deployments, secret management, Bi-weekly release cycle, peer review, and integration testing
Senior Software Engineer
Nordstrom
01.2017 - 02.2018
Lead engineer to design and drive execution of a promotion engine built on top of the Neo4j graph database
Built the team of 5 engineers, mentored them and made them proficient in graph data modeling, databases clustering and helped them to deliver the promotion REST API.
Senior Software Engineer
Nordstrom
01.2016 - 01.2017
Lead engineer to design and drive the build-out of ELK ( Elasticsearch-Logstash-Kibana) stack for our log analytics
Self-supported the open-source version of Elasticsearch
Senior Software Engineer
Nordstrom
12.2014 - 12.2015
Lead engineer to design and drive execution of our Order Routing engine with JBOSS rules engine
Worked with business, legacy order management systems to understand the existing rules and document them
Created a framework to test multiple version of rules in production, so that changes to routing logic can be verified before implementation
Skills
Expertise in both open source Hadoop stack and EMR Data Pipeline design, development and job performance tuning ( Spark and Presto)
Expertise with EMR On EKS ( Spark on kubernetes)
Expertise in batch and streaming data pipelines leveraging Apache Kafka, Apache spark and serverless compute
Data catalog solutions built leveraging Apache Atlas and Apache Ranger
AAA ( Authentication, Authorization and Auditing ) leveraging Apache Knox and Apache Ranger
ELK stack for log analytics and alerting Observability stack leveraging Prometheus and Grafana APM tools like AWS X-Ray and Dynatrace
Promotion and product catalog implementation using neo4j graph database
Order fulfillment routing logic implementation leveraging JBOSS rules engine
AWS cost management optimization
Job orchestration engines like Airflow and Oozie
Designed and built enterprise Big data platform from open source Apache projects like knox, ranger, atlas,NiFi with AWS EMR
Expertise in Point of Sales system including mobile POS
Accomplishments
Published EMR on EKS best practices recommendations and gave sessions for customers.( https://aws.github.io/aws-emr-containers-best-practices/, https://youtube/D3WOEhcev88?si=xgt2iSL0LTHX1pGT)
Won Nordstrom Hackathon for implementing Store Printing As a Service which abstracts the printer model and make from all applications used inside the store.
Helped to design and review code commits to Apache Knox.
Presented at 2019 Starburst Presto conference and presented the multi tenant, on demand scaling of our presto implementation.