Summary
Overview
Work History
Education
Skills
Research
Timeline
Generic

Simarjeev Singh

San Francisco,CA

Summary

Proactive specialist in data modeling, analytics, and software development for large-scale projects. Proficient in monitoring, troubleshooting, and optimizing database performance. Demonstrates analytical prowess, effective problem-solving, and in-depth knowledge of database technologies. Accomplished working independently or collaboratively with strong communication skills.

Overview

6
6
years of professional experience

Work History

Data Engineer

Twitch
10.2021 - 01.2024
  • Served as the central data expert, enabling data-driven executive decisions by creating 50+ datasets for the warehouse and spearheading organization-wide data quality initiatives.
  • Designed and developed a robust DQ tool, leading a team of 5 engineers, catering to customer needs with 40+ tests for data validation. Enabled easy adoption by both technical and non-technical staff, reducing data quality issue response time by 85%.
  • Attuned to C-suite needs, determined data enrichment priorities, creating 15+ real-time dashboards, and streamlining data quality processing times by 15%.

DATA ENGINEER

FACEBOOK
10.2020 - 11.2021
  • Engineered, maintained, and validated data pipelines pivotal to the Messenger Ecosystems, achieving a 12% reduction in query latency, thereby enhancing data processing speed and overall system responsiveness.
  • Developed and deployed 15 interactive data visualization dashboards enabling stakeholders to analyze key metrics, such as MAU and DAV, across various splits for comprehensive insights in making informed business decisions.
  • Collaborated seamlessly across teams to enhance data quality, identifying and rectifying anomalies in the Instagram usage flow, resulting in a 8% improvement in data accuracy in fired events.

DATA ENGINEER - LEAD GENERATION TEAM

ZAPLABS
05.2018 - 10.2020
  • Optimized data processing by implementing efficient ETL pipelines and streamlining database design.
  • Automated real estate data cleaning, reducing data quality anomalies and errors by 30%.
  • Designed data models and implemented 10+ Airflow DAGs to construct vital datasets for downstream data products.

Education

BA IN DATA SCIENCE -

UNIVERSITY OF CALIFORNIA, BERKELEY
Berkeley, CA
12.2019

Skills

  • ETL Development
  • Data Modeling
  • Data Governance
  • Data Visualization
  • Data Warehousing

Research

  • CITRIS TECH FOR SOCIAL GOOD, HEAD UNDERGRAD RESEARCHER, 08/2018 - 05/2020, Berkeley, CA, Leading a technical research project to analyze bots on social media using NLP and Social Network Analysis. Speaking as a panelist at a "Digital Equality" conference at Univeristy Tec de Monterrey in Mexico City.
  • FRANKFURT BIG DATA LAB, DATA SCIENCE RESEARCHER, 08/2017 - 12/2017, Berkeley, CA, Used the Geisinger Health System clinical dataset to find the correlations among depression, insomnia/sleeping disorders and anxiety. Publication submitted.

Timeline

Data Engineer

Twitch
10.2021 - 01.2024

DATA ENGINEER

FACEBOOK
10.2020 - 11.2021

DATA ENGINEER - LEAD GENERATION TEAM

ZAPLABS
05.2018 - 10.2020

BA IN DATA SCIENCE -

UNIVERSITY OF CALIFORNIA, BERKELEY
Simarjeev Singh