Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Sohum Patel

Berlin,CT

Summary

Data Engineer | Python, SQL, Google Cloud Platform

Overview

5
5
years of professional experience
1
1
Certification

Work History

Financial Market Stack Data Pipeline

02.2026 - Current
  • Designing and implementing a cloud-based ETL pipeline regarding dividend and other financial data using Google Cloud Platform services including Cloud Run for deploying containerized ETL pipeline, BigQuery for analyzing trends, Pub/Sub for ensuring ......, and Cloud Scheduler for running the pipeline on a weekly basis.

Data Engineer

Cognixia USA
09.2022 - 12.2024
  • Supported and improved enterprise ETL/ELT pipelines by adding new features, optimizing SQL transformations, and maintaining Ab Initio pipelines to ensure accurate ingestion of healthcare datasets into BigQuery.
  • Achieved ~50% improvement in query performance by implementing performance tuning and optimization strategies such as table partitioning and clustering in BigQuery, resulting in significant data scan reduction and improved cost efficiency.
  • Integrated data from Hadoop, SQL Server, and DB2 using Infoworks and in-house applications to centralize enterprise datasets for analytics. Conducted unit testing and data validation to ensure accuracy and reliability of datasets.

Software Engineer Intern

Tech180
05.2021 - 07.2021
  • Automated generation of SolidWorks 3D models and 2D drawings using Python scripts integrated with a MySQL database that contained customer requirements, reducing design time by up to 70%.

Education

Bachelor of Science - Computer Science And Engineering

University of Connecticut
Storrs, CT
12.2021

Skills

    Languages & Scripting: Python, SQL (BigQuery, PostgreSQL, MySQL)
    Cloud & Big Data: Google Cloud Platform (BigQuery, Looker, Storage, Dataproc, Composer, Cloud Run, Cloud Functions), Hadoop (Hive)
    Data Engineering & ETL: ETL/ELT pipelines, Data ingestion & transformation, Data warehousing, Performance tuning & optimization, Ab Initio, Infoworks
    Orchestration & Workflow: Apache Airflow, Tidal
    Databases: SQL Server, DB2
    Other Tools & Technologies: Linux, Git, Agile methodologies, Data validation & testing

    Professional Skills: Time management, Multitasking, Calm under pressure, Verbal communication

Certification

  • Google Cloud Certified – Associate Cloud Engineer
  • Google Cloud Certified – Cloud Digital Leader
  • Databricks – Databricks Fundamentals

Timeline

Financial Market Stack Data Pipeline

02.2026 - Current

Data Engineer

Cognixia USA
09.2022 - 12.2024

Software Engineer Intern

Tech180
05.2021 - 07.2021

Bachelor of Science - Computer Science And Engineering

University of Connecticut
Sohum Patel