Summary
Overview
Work History
Education
Skills
TechStack
Accomplishments
Certification
Websites
Timeline
Generic
Priya Sundaram

Priya Sundaram

San Ramon,CA

Summary

Technology and Engineering Leader specializing in data analytics, machine learning, and AI architecture. Proven ability to drive results through effective leadership, project management, and process optimization. Successfully contributed to data-driven projects that achieved significant positive outcomes.

Overview

20
20
years of professional experience
1
1
Certification

Work History

Data Engineering Manager

Brooks Sports Inc
Remote, CA
11.2021 - Current
  • Led a complete migration of data pipelines from a custom ETL framework to Airflow, which reduced technical debt by 90% and improved production troubleshooting time by 80%.
  • Optimized Snowflake performance to achieve a 28% cost savings.
  • Formed and led a team of 2 data engineers, a solution architect, and a DevOps engineer, building the Data Engineering team from the ground up.
  • Established an offshore production support team to provide 24/7 support, working with business teams to set up SLAs for all data pipelines and streamline support and reporting to leadership.
  • Implemented a comprehensive DataOps framework, enhancing data observability and reliability through automated data quality functions, alerting, and monitoring.
  • Developed and deployed an MLOps framework to increase data scientist productivity by 50% through improved tool adoption and processes for model onboarding using AWS Sagemaker AI
  • Oversaw the Platform Data Engineering team, driving the introduction of new tools and technologies, automation, and self-service capabilities.
  • Crafted a strategic 1-3 year roadmap for the Data Engineering organization, aligning future initiatives with business goals.

Lead Data Engineer/Solution Architect

Albertsons Companies
Pleasanton, CA
10.2019 - 11.2021
  • Led implementation of next-generation real-time data warehouse in Snowflake with Kafka and Azure technologies.
  • Managed high-visibility data engineering projects during COVID, enhancing online shopping experience.
  • Achieved 90% reduction in price parity by updating online product catalog via real-time data pipelines.
  • Reviewed technical debt and new capabilities, collaborating with stakeholders on release roadmaps.
  • Collaborated with architects, data modelers, and engineers to deliver robust data solutions.
  • Served as subject matter expert for all data projects within loyalty and fulfillment COE teams.

IT Manager

Cognizant Technology Solutions
Pleasanton, CA
04.2019 - 10.2019
  • Led two Data Engineers in migrating machine learning pipelines from Spark 1.8 to Spark 2.3.
  • Migrated ETL pipelines from Cloudera cluster in Spark 1.6 to EMR using Spark 2.3.
  • Performed enhancements and resolved job failures, testing processes in EMR and scheduling in Airflow.
  • Updated ETL pipelines with new Geo definitions, optimizing query performance using Hive parameters.
  • Redesigned Host Revenue Management pipeline, collaborating with Data Science team to rectify logical errors.
  • Migrated Host Revenue Management pipeline from Hive to Spark 2.3, enhancing operational efficiency.

Lead Data Engineering Consultant

Airbnb
Mountain View, CA
06.2018 - 04.2019
  • Managed Omnichannel - Customer 360 project to deliver comprehensive customer insights.
  • Created Data Lake by extracting data from diverse sources, including flat files and databases.
  • Utilized Apache Hive/Impala for efficient ETL processes on HDFS with dynamic partitioning.
  • Constructed aggregate layers using Hive/Impala queries for weekly and monthly reporting.
  • Designed ETL frameworks with internal and external tables, storing data in parquet format.
  • Optimized performance through data partitioning and physicalization of Hive parameters.
  • Developed process validation scripts to generate exception reports, ensuring data quality.
  • Automated ETL processes with Bash and Python, scheduling jobs in Oozie.

Senior Big Data Engineer

Bank of the West
Mountain View, CA
06.2011 - 06.2018
  • Developed data pipeline for ingesting cyber-attack information using Python and Oozie scheduler for Cybersecurity Analytics Team.
  • Designed and managed external Hive tables for efficient data storage in Parquet, AVRO, and ORC formats.
  • Collaborated on database schema design with Erwin, integrating data from Oracle, SQL Server, and web sources.
  • Automated machine learning model for online account attrition with Data Scientist team using Shellscript.
  • Executed Oozie workflows to schedule Sqoop and Hive jobs for data extraction, transformation, and loading.
  • Formulated complex ETL queries for Analytics team to retrieve data from Oracle and SQL Server into Hive via Sqoop.
  • Acquired expertise in PySpark dataframes for building machine learning models and transformations.
  • Configured MongoDB cluster with three replica sets and sharding as proof of concept for NoSQL database.

Senior Oracle Consultant

Gilead Sciences
Foster City, CA
06.2009 - 11.2011

Senior Oracle Consultant

Cisco International
06.2009 - 11.2009

Senior Applications DBA

Finisar Corporations
Sunnyvale, CA
09.2006 - 06.2009

Oracle Applications DBA

Phoenix Technologies
Milpitas, CA
10.2005 - 09.2006

Education

Standford LEAD - Business & Leadership Program

Stanford Graduate School of Business
Palo Alto, CA
05-2025

B.E. - Electrical & Electronics Engineering

Periyar University
Tamilnadu, India
04-2002

Skills

  • Architecture and engineering management
  • Data strategy development
  • Team development and mentorship
  • Vendor management
  • Project management and agile practices
  • Stakeholder communication strategies
  • Data engineering and platform engineering
  • Generative AI and machine learning
  • Data modeling and visualization
  • Database development and administration
  • Real-time analytics
  • Performance optimization
  • Disaster recovery planning

TechStack

AI / Machine Learning: AWS SageMaker, Prompt Engineering

Big Data Technologies: Hadoop, HDFS, MapReduce, Hive, Pig, Sqoop, Oozie, AWS S3, Spark SQL, Apache Hue, Cloudera Manager, EMR

Programming Languages: Python, Bash Script, SQL, Scala

Databases: Snowflake, Oracle, SQL Server, MongoDB

Operating Systems: Linux, Sun Solaris, AIX, Windows

Scheduling Tools: Airflow, Oozie, Crontab

Accomplishments

  • Above & Beyond award for supporting online shopping experience project during COVID (Albertsons)
  • Innovation award for supporting data streams for ML Engineering projects(Albertsons)
  • Nominated for Top women in Grocer in Technical Category for building real time product catalog update in online website (Albertsons)
  • Intellectual Contribution Award at Stanford LEAD (Power of Story)

Certification

  • Exercising Leadership: Foundation Principles from HarvardX
  • Fundamentals of Project Planning and Management from Coursera
  • Algorithmic Toolbox
  • Python Classes & Inheritance
  • AWS Certified Cloud Practitioner
  • BIGDATA Hadoop Certification
  • Oracle Certified Professional
  • Snowpro Certification

Timeline

Data Engineering Manager

Brooks Sports Inc
11.2021 - Current

Lead Data Engineer/Solution Architect

Albertsons Companies
10.2019 - 11.2021

IT Manager

Cognizant Technology Solutions
04.2019 - 10.2019

Lead Data Engineering Consultant

Airbnb
06.2018 - 04.2019

Senior Big Data Engineer

Bank of the West
06.2011 - 06.2018

Senior Oracle Consultant

Gilead Sciences
06.2009 - 11.2011

Senior Oracle Consultant

Cisco International
06.2009 - 11.2009

Senior Applications DBA

Finisar Corporations
09.2006 - 06.2009

Oracle Applications DBA

Phoenix Technologies
10.2005 - 09.2006

Standford LEAD - Business & Leadership Program

Stanford Graduate School of Business

B.E. - Electrical & Electronics Engineering

Periyar University