- Acted as a subject matter expert in analyzing requirements and preparing specifications for data ingestion pipelines across multiple Google Cloud BigQuery projects.
- Owned end-to-end data solutions in high-visibility roles, collaborating with cross-functional teams including business analysts, developers, QA, and reporting teams.
- Led the FBI reporting modernization project by migrating legacy Oracle-based ETL processes to BigQuery and transitioning reporting from OBIEE to Cognos.
- Collaborated with BI teams to troubleshoot and resolve data issues between BigQuery datasets and reporting tools during QA and UAT cycles.
- Designed and implemented batch and near real-time data ingestion pipelines into BigQuery using Cloud Composer (Airflow), Dataflow, and Python, supporting internal business reporting use cases.
- Created and maintained complex BigQuery tables, views, and analytical SQL queries for downstream reporting and analytics.
- Orchestrated ETL and data integration workflows using Cloud Composer and Control-M, ensuring efficient job scheduling and error handling.
- Converted legacy Unix-based GCP export batch jobs from Edge nodes into modular, maintainable Python code.
- Developed both built-in and custom Dataflow pipelines using Java to ingest near real-time data from Cloud Spanner and MongoDB via Pub/Sub into BigQuery for analytics.
- Created a POC using Dataflow and Python for seamless MongoDB-to-BigQuery migration, showcasing flexibility in tool adoption.
- Integrated source code via GitLab and managed deployment workflows to UAT and Production environments using Jenkins CI/CD pipelines.
- Successfully delivered four high-impact BigQuery projects within eight months through effective planning, execution, and stakeholder coordination.
- Supervised and reviewed offshore development activities, ensuring code quality and timely delivery.
- Monitored Dataproc Spark jobs and led a POC initiative to convert Spark jobs to PySpark, improving pipeline maintainability and performance.
- Solid understanding and hands-on experience with CI/CD practices using Git, Jenkins, and cloud-native deployment patterns.
Skills: Python, Java, PySpark, Google Cloud Platform (BigQuery, Cloud Composer, Dataflow, Pub/Sub, Spanner, Dataproc), Airflow, Control-M, Jenkins, GitLab, GitHub, MongoDB, Cognos, OBIEE, Jira