10+ years of experience in Data Warehousing, Big Data, ETL, and Cloud-based data solutions testing across GCP, AWS, and Azure. Expertise in designing and executing robust data quality frameworks for large-scale data systems. Skilled in validating end-to-end data pipelines to ensure data accuracy, completeness, and performance on batch and streaming data. Strong experience in batch and streaming data validation for high-volume processing environments. Proficient in automation scripting, SQL-based testing, API validation, and cloud data platform verification. Proficient in GCP services, including DataProc, Vertex AI Workbench, BigQuery, PubSub, Dataflow, Data Discovery, Looker Studio, and Serverless frameworks (session/batch). Experienced in data modeling, ETL/ELT testing, and data pipeline validation across GCP, and AWS environments. Hands-on experience with Kafka, ensuring real-time data ingestion and seamless integration with BigQuery and other GCP services. Proficient in testing end-to-end BigQuery data pipelines, validating schemas, transformations, and large-scale querying. Expertise in functional, regression, and performance testing of data pipelines and analytical workflows using Python, PySpark, and SQL. Experience in test automation for data workflows, including tools like Airflow, pytest, and Google Cloud SDK. Adept at collaborating with developers, analysts, and business stakeholders to deliver error-free, production-ready data solutions.
Data Validation & Testing: Data quality checks, ETL testing, big data validation, cloud data warehouse testing, regression testing, back-end testing, source-to-target validation, data migration testing