Summary
Overview
Work History
Education
Skills
Personal Information
Languages
Timeline
Generic

Muhammad Janjua

San Ramon,United States

Summary

Data Engineer with expertise in building data pipelines from the ground up, launching 0-to-1 products, and enabling data-driven decision-making for data scientists. Experience spanning advertising analytics at Meta, legal technology at Google, and enterprise data solutions across cloud platforms. Strong background in SQL, Python, Apache Spark, Databricks, and building scalable data warehouses.

Overview

7
7
years of professional experience

Work History

Data Engineer

Meta
Menlo Park
05.2025 - Current
0-to-1 Product Development & Foundational Data Layers
  • Built foundational data infrastructure for new advertising products from scratch, enabling analytics capabilities before product launch
  • Designed and implemented core data models and pipelines that serve as the backbone for product metrics, reporting, and experimentation
  • Partnered with product and engineering teams to define data requirements, schemas, and key metrics for greenfield initiatives
  • Established data quality frameworks and monitoring for newly launched products, ensuring reliable insights from day one
Data Pipeline Development & Infrastructure
  • Developed and maintained scalable data pipelines for revenue attribution analysis across the ads streaming infrastructure
  • Built pipeline-to-datasource mapping systems for data lineage, cataloging, and dependency tracking
  • Monitored streaming pipeline health, tracking latency violations and SLA compliance for ad delivery systems
  • Supported data scientists by ensuring reliable, timely data availability for analytics and modeling
Product-Centric Ads (PCA) Analytics
  • Built the foundational analytics layer for PCA, a new advertising product enabling dynamic product recommendations
  • Created and maintained the PCA Longterm Holdout Dashboard, tracking statistical metrics for experiment analysis including conversions, clicks, impressions, ad scores, CTR, CVR, and event-based revenue with confidence intervals
  • Developed the Product Selection Delivery Funnel Dashboard, providing end-to-end visibility from catalog inclusion to conversion, identifying bottlenecks and drop-off rates across ad formats
Catalog Product Video (CPV) Funnel Analysis
  • Designed the initial data architecture for CPV funnel tracking as the product launched
  • Built the CPV Funnel Dashboard, tracking 7-stage user progression to identify revenue attrition points across ad formats (Carousel, Single Media, Collection) and top placements (Facebook Feed, Facebook Reels, Instagram Feed, Instagram Reels, etc.)
Experimentation & Targeting Analytics
  • Partnered with data scientists to perform A/B experiment analysis for Instagram and Ads experiments, measuring statistical significance and business impact
  • Created Targeting Unified Monitoring Dashboard for launches and experiments, analyzing revenue exclusion across campaign dimensions
  • Supported Instagram holdout experiment analysis to measure long-term product impact
Dashboard Quality & Standards
  • Monitored dashboard quality scores, ensuring compliance with visualization, reliability, performance, and data freshness standards
  • Established best practices for data engineering dashboard creation

Data Engineer

Slalom
Redwood City, California
05.2021 - 03.2025
0-to-1 Product Launch & Data Strategy
  • Spearheaded the Patents Data Engineering Team at Google, leading to the development and launch of the innovative product 'ADAPT Legal', which revolutionized the way patent data was analyzed and utilized
  • Developed and implemented comprehensive data strategies that played a pivotal role in preserving, defending, and augmenting the value of the company's patented technological assets
  • Created the Patent Blocking Index (PBI) – a novel numeric index that determines the role a patent plays in blocking another patent, calculated by analyzing citations and references between patents
Scalable Data Warehouse & Infrastructure
  • Built and managed scalable data warehouses using Apache Spark and Databricks, enabling high-performance analytics across enterprise datasets
  • Took charge of building and managing the data infrastructure, building ETL processes and ensuring data integrity across multiple workflow applications, resulting in significant reduction in data processing times
  • Implemented data unification targeting multiple sources utilizing REST APIs
  • Utilized Presto-powered pipelines in DataSwarm (Airflow) to extract, transform, and load data from Hive
Web Analytics & Data Streaming
  • Developed and implemented a comprehensive web analytics solution for the centralized compliance and ethics hub at Meta
  • Improved data accuracy by designing and implementing data streaming architecture in collaboration with Full Stack Engineers
  • Designed and implemented an efficient logging spec architecture to enable seamless data landing in Hive tables, minimizing the need for intensive data cleaning efforts
  • Created interactive Tableau dashboards to provide stakeholders with real-time visibility into key performance indicators
Data Science Enablement & ML Collaboration
  • Strategically collaborated with machine learning experts and data scientists to enable their workstreams and support advanced analytics initiatives
  • Achieved a 156% year-over-year (YOY) increase in traffic on the compliance hub by leveraging data insights and implementing effective targeting strategies

Data and Analytics Engineer

Insight Global
Mountain View, United States
01.2019 - 04.2021

End-to-End Go-to-Market Analytics

  • Led an end-to-end project for Go-to-Market Chrome OS sales, delivering comprehensive analytics capabilities to drive sales strategy
  • Led deep architectural discussions to transform raw data into actionable Knowledge KPIs, driving data-driven decision-making for multiple clients
  • Developed data architecture and worked with vendors to streamline data ingestion processes across the Chrome OS ecosystem
  • Built ETL data pipelines, BI models, and dashboards on GCP Cloud platform using Looker Studio and Plx tools
  • Increased data processing speed by 35% by building optimized ETL pipelines for 3 major projects
  • Collaborated with cross-functional teams to design and implement data integrations, ensuring seamless flow of metrics across systems
  • Onboarded and managed 10 partner repair centers across the globe, standardizing data models to smooth the data ingestion process and reducing failed pipeline pickups

Education

Bachelors of Science - Industrial Engineering

CSU EastBay University

Skills

  • Data modeling
  • Data curating
  • Data governance
  • Google Cloud Platform and SQL
  • ETL and ELT processes
  • Data warehousing solutions
  • Data visualization tools
  • Tableau and Looker
  • Snowflake and Databricks
  • Presto and Hive
  • Apache Spark

Personal Information

Title: Experienced Data Engineer

Languages

  • Billingual

Timeline

Data Engineer

Meta
05.2025 - Current

Data Engineer

Slalom
05.2021 - 03.2025

Data and Analytics Engineer

Insight Global
01.2019 - 04.2021

Bachelors of Science - Industrial Engineering

CSU EastBay University
Muhammad Janjua