Data Engineer with expertise in building data pipelines from the ground up, launching 0-to-1 products, and enabling data-driven decision-making for data scientists. Experience spanning advertising analytics at Meta, legal technology at Google, and enterprise data solutions across cloud platforms. Strong background in SQL, Python, Apache Spark, Databricks, and building scalable data warehouses.
Overview
7
7
years of professional experience
Work History
Data Engineer
Meta
Menlo Park
05.2025 - Current
0-to-1 Product Development & Foundational Data Layers
Built foundational data infrastructure for new advertising products from scratch, enabling analytics capabilities before product launch
Designed and implemented core data models and pipelines that serve as the backbone for product metrics, reporting, and experimentation
Partnered with product and engineering teams to define data requirements, schemas, and key metrics for greenfield initiatives
Established data quality frameworks and monitoring for newly launched products, ensuring reliable insights from day one
Data Pipeline Development & Infrastructure
Developed and maintained scalable data pipelines for revenue attribution analysis across the ads streaming infrastructure
Built pipeline-to-datasource mapping systems for data lineage, cataloging, and dependency tracking
Monitored streaming pipeline health, tracking latency violations and SLA compliance for ad delivery systems
Supported data scientists by ensuring reliable, timely data availability for analytics and modeling
Product-Centric Ads (PCA) Analytics
Built the foundational analytics layer for PCA, a new advertising product enabling dynamic product recommendations
Created and maintained the PCA Longterm Holdout Dashboard, tracking statistical metrics for experiment analysis including conversions, clicks, impressions, ad scores, CTR, CVR, and event-based revenue with confidence intervals
Developed the Product Selection Delivery Funnel Dashboard, providing end-to-end visibility from catalog inclusion to conversion, identifying bottlenecks and drop-off rates across ad formats
Catalog Product Video (CPV) Funnel Analysis
Designed the initial data architecture for CPV funnel tracking as the product launched
Built the CPV Funnel Dashboard, tracking 7-stage user progression to identify revenue attrition points across ad formats (Carousel, Single Media, Collection) and top placements (Facebook Feed, Facebook Reels, Instagram Feed, Instagram Reels, etc.)
Experimentation & Targeting Analytics
Partnered with data scientists to perform A/B experiment analysis for Instagram and Ads experiments, measuring statistical significance and business impact
Created Targeting Unified Monitoring Dashboard for launches and experiments, analyzing revenue exclusion across campaign dimensions
Supported Instagram holdout experiment analysis to measure long-term product impact
Dashboard Quality & Standards
Monitored dashboard quality scores, ensuring compliance with visualization, reliability, performance, and data freshness standards
Established best practices for data engineering dashboard creation
Data Engineer
Slalom
Redwood City, California
05.2021 - 03.2025
0-to-1 Product Launch & Data Strategy
Spearheaded the Patents Data Engineering Team at Google, leading to the development and launch of the innovative product 'ADAPT Legal', which revolutionized the way patent data was analyzed and utilized
Developed and implemented comprehensive data strategies that played a pivotal role in preserving, defending, and augmenting the value of the company's patented technological assets
Created the Patent Blocking Index (PBI) – a novel numeric index that determines the role a patent plays in blocking another patent, calculated by analyzing citations and references between patents
Scalable Data Warehouse & Infrastructure
Built and managed scalable data warehouses using Apache Spark and Databricks, enabling high-performance analytics across enterprise datasets
Took charge of building and managing the data infrastructure, building ETL processes and ensuring data integrity across multiple workflow applications, resulting in significant reduction in data processing times
Implemented data unification targeting multiple sources utilizing REST APIs
Utilized Presto-powered pipelines in DataSwarm (Airflow) to extract, transform, and load data from Hive
Web Analytics & Data Streaming
Developed and implemented a comprehensive web analytics solution for the centralized compliance and ethics hub at Meta
Improved data accuracy by designing and implementing data streaming architecture in collaboration with Full Stack Engineers
Designed and implemented an efficient logging spec architecture to enable seamless data landing in Hive tables, minimizing the need for intensive data cleaning efforts
Created interactive Tableau dashboards to provide stakeholders with real-time visibility into key performance indicators
Data Science Enablement & ML Collaboration
Strategically collaborated with machine learning experts and data scientists to enable their workstreams and support advanced analytics initiatives
Achieved a 156% year-over-year (YOY) increase in traffic on the compliance hub by leveraging data insights and implementing effective targeting strategies
Data and Analytics Engineer
Insight Global
Mountain View, United States
01.2019 - 04.2021
End-to-End Go-to-Market Analytics
Led an end-to-end project for Go-to-Market Chrome OS sales, delivering comprehensive analytics capabilities to drive sales strategy
Led deep architectural discussions to transform raw data into actionable Knowledge KPIs, driving data-driven decision-making for multiple clients
Developed data architecture and worked with vendors to streamline data ingestion processes across the Chrome OS ecosystem
Built ETL data pipelines, BI models, and dashboards on GCP Cloud platform using Looker Studio and Plx tools
Increased data processing speed by 35% by building optimized ETL pipelines for 3 major projects
Collaborated with cross-functional teams to design and implement data integrations, ensuring seamless flow of metrics across systems
Onboarded and managed 10 partner repair centers across the globe, standardizing data models to smooth the data ingestion process and reducing failed pipeline pickups