Summary
Overview
Work History
Skills
Software Skills
Education
Quote
Timeline
Certification
Languages
Generic

Sudhakar Selvarajan

Phoenix,Arizona

Summary

As a Data Technology and Machine Learning Manager, I focus on harnessing data to drive strategic insights and inform decision-making processes. With expertise in cloud-based data solutions including GCP, Azure, Snowflake, and Databricks, I have been pivotal in maintaining data platforms, building robust data pipelines, and enhancing analytical and machine learning capabilities. By partnering with cross-functional teams, I enable organizations to navigate complex data challenges through the implementation of advanced solutions. Bringing a wealth of experience across diverse industries such as the financial sector, healthcare, and retail, I contribute significant value and a broad perspective to the organizations I work with.

Overview

20
20
years of professional experience
1
1
Certificate

Work History

Manager , Data Technology & Machine Learning

PetSmart
11.2023 - Current
  • Maintain and oversee platform engineering for all data platforms which includes Databricks, Snowflake, Informatica, Cleo, Alteryx and GCP BigData.
  • Manage platforms and finances for projects worth more than $20 million.
  • Lead machine learning engineering - Create and maintain MLOPS architecture, Feature engineering and venturing into LLMs and LLMOPS
  • Lead data modernization projects like Data maturity program which involves decommissioning of workloads running on Informatica power center and netezza and move them to Spark on databricks. Migrating 2000 tables and 800 jobs to databricks.
  • Implement Data Clean rooms and Data sharing using cloud based technologies
  • Lead cross-functional teams to achieve project goals.
  • Reduce operational costs through comprehensive process improvement initiatives and resource management.
  • Lead innovation strategy for data organization.
  • Achieved significant cost savings by renegotiating contracts with key vendors, without compromising service quality.
  • Developed strong company culture focused on employee engagement, collaboration, and continuous learning opportunities - Created center of excellence and office hours for data engineering.
  • Maintain and oversee vendor relations and finances for all the data platforms.

Manager - BigData Engineering & Platforms

PetSmart
04.2022 - 10.2023
  • Strong technical leadership skills with experience in managing customer, vendor and teams .
  • Led a global team of 60 bigdata data engineers .
  • Provided technical and architectural insights to my team to help them create Ingestion, data quality and data comparison frameworks in databricks
  • Streamlined project delivery processes, significantly reducing time to market for new product launches.
  • Organized professional development programs for staff, leading to improved performance and skill sets.
  • Achieved departmental goals by developing and executing strategic plans and performance metrics.
  • Enhanced team productivity by implementing agile methodologies, leading to more efficient project completion.
  • Managed budgets effectively, ensuring optimal financial performance while investing in necessary resources for business growth.
  • Built high-performing teams through effective recruitment, onboarding, and talent development initiatives.

Lead BigData Engineer

PetSmart
04.2021 - 03.2022
  • Led team of 5 Big Data engineers in designing, developing, and maintaining data pipelines.
  • Architected end-to-end data solutions using Databricks.
  • Designed and developed complex ETL pipelines using Apache Spark and Databricks,.
  • Championed best practices for data engineering within organization.
  • Managed performance tuning and optimization efforts for Databricks and Snowflake,.
  • Mentored and provided technical guidance to junior team members.
  • Evaluated and selected appropriate technologies and tools for data processing, storage, and analytics.
  • Collaborated with cross-functional teams to ensure data governance, security, and compliance requirements were met throughout the data engineering lifecycle.
  • Designed and maintained data architecture documentation, data flow diagrams, and technical specifications.
  • Collaborated with infrastructure and operations teams to manage and scale the Databricks and Snowflake environments,.

Senior BigData Engineer

PetSmart
08.2019 - 03.2021
  • Designed, developed, and maintained complex data pipelines using Databricks and Apache Spark.
  • Implemented performance optimizations and tuning techniques on Spark jobs.
  • Led integration of real-time and batch processing pipelines within Databricks.
  • Implemented data quality checks and validation processes.
  • Created process of code reviews in data engineering team.
  • Troubleshoot and resolved complex data issues.

BigData Engineer

Infowave Systems Inc
05.2014 - 07.2019

Client : Dignity Health

  • Led a team to build data ingestion pipelines to migrate data from legacy mainframe application to Hadoop ecosystem.
  • Designed and developed complex, near real time, severe sepsis time zero and sepsis bundle compliance logic in SQOOP, PIG and Python which was displayed in dashboards and apps to help physicians make clinical decisions
  • Built data pipelines to bring in population health data into Hadoop ecosystem, archive the data and send it out to Athena
  • Implemented the machine learning algorithm that was provided by the clinicians, in PIG and Python
  • Designed and developed near real time data pipelines to bring clinical and ADT data, using SQOOP and PIG scripts and sent them over to third party vendor to trigger alerts with respect to patient safety in hospitals

Project Lead , Application Architect

Syntel Inc
09.2004 - 04.2014

Client : American Express

  • Creation of various data models for storing merchant information.
  • Perform strategic data analysis and research to device predictive analysis algorithm.
  • Optimize ETL process to enable faster and efficient transformation and loads.
  • Migration of DB from DB2 to Netezza.
  • Migration of ETL from Datastage 8.1 to 8.7.
  • Created and managed financial capture applications

Skills

  • Data Engineering
  • Machine Learning
  • Gen AI
  • Spark
  • Databricks
  • Snowflake
  • GCP
  • Azure Cloud Eco System
  • Data Clean Rooms
  • Data Sharing
  • Bigdata (HDFS, Hive, HBase etc)
  • Delivery Excellence
  • Architecture
  • Project oversight
  • Technology leadership
  • Building great teams
  • Strategy planning
  • Netezza, SQL server, Oracle
  • ETL tools
  • Mainframes
  • DevOps
  • Agile, Scrum

Software Skills

  • Python
  • SQL
  • Terraform
  • Docker
  • Kubernetes

Education

Bachelor of Engineering - Mechanical Engineering

College Of Engineering , Guindy , Anna University
Chennai
05.2004

Quote

If you really look closely, most overnight successes took a long time.
Steve Jobs

Timeline

Manager , Data Technology & Machine Learning

PetSmart
11.2023 - Current

Manager - BigData Engineering & Platforms

PetSmart
04.2022 - 10.2023

Lead BigData Engineer

PetSmart
04.2021 - 03.2022

Senior BigData Engineer

PetSmart
08.2019 - 03.2021

BigData Engineer

Infowave Systems Inc
05.2014 - 07.2019

Project Lead , Application Architect

Syntel Inc
09.2004 - 04.2014

Bachelor of Engineering - Mechanical Engineering

College Of Engineering , Guindy , Anna University

Certification

  • Databricks Associate Engineer
  • IBM Datastage

Languages

English
Full Professional
Tamil
Native or Bilingual
Hindi
Professional Working
Sudhakar Selvarajan