Summary
Overview
Work History
Education
Skills
Certification
Interests
IN DETAIL
Timeline
Generic

Vanitha V

Senior Data Test Engineer
Sunnyvale,CA

Summary

10+ years of experience in Data Warehousing, Big Data, ETL, and Cloud-based data solutions testing across GCP, AWS, and Azure. Expertise in designing and executing robust data quality frameworks for large-scale data systems. Skilled in validating end-to-end data pipelines to ensure data accuracy, completeness, and performance on batch and streaming data. Strong experience in batch and streaming data validation for high-volume processing environments. Proficient in automation scripting, SQL-based testing, API validation, and cloud data platform verification. Proficient in GCP services, including DataProc, Vertex AI Workbench, BigQuery, PubSub, Dataflow, Data Discovery, Looker Studio, and Serverless frameworks (session/batch). Experienced in data modeling, ETL/ELT testing, and data pipeline validation across GCP, and AWS environments. Hands-on experience with Kafka, ensuring real-time data ingestion and seamless integration with BigQuery and other GCP services. Proficient in testing end-to-end BigQuery data pipelines, validating schemas, transformations, and large-scale querying. Expertise in functional, regression, and performance testing of data pipelines and analytical workflows using Python, PySpark, and SQL. Experience in test automation for data workflows, including tools like Airflow, pytest, and Google Cloud SDK. Adept at collaborating with developers, analysts, and business stakeholders to deliver error-free, production-ready data solutions.

Overview

13
13
years of professional experience
3
3
Certifications

Work History

Senior GCP Data Test Engineer

Walmart Labs
10.2024 - Current
  • Designed and executed test cases for Kafka → Dataflow → BigQuery ingestion workflows.
  • Performed end-to-end ETL testing in GCP to ensure integrity and accuracy of data transformations, joins, and aggregations for customer 360 profiles.
  • Validated data accuracy, transformation logic, deduplication, and performance benchmarks across large-scale GCP DataProc jobs.
  • Conducted functional and regression testing for complex PySpark pipelines integrating Salesforce Data Cloud. Verified message delivery SLAs, schema evolution handling, and business rules compliance.
  • Built automated SQL validation scripts for cross-verifying transformed datasets against source systems.
  • Validated machine learning scoring outputs in Vertex AI for accuracy and consistency.
  • Created Airflow DAG validation scripts to ensure orchestration jobs executed successfully without data loss.
  • Conducted performance and load testing for large-scale Spark jobs on Dataproc.
  • Designed and executed test cases to validate Tableau dashboards for personalized recommendations and marketing analytics and delivered test summary reports for QA sign-off.
  • Ensured accurate data mapping from BigQuery to visualizations, with real-time update verification.
  • Tested integrations using Cloud Storage, Pub/Sub, Cloud Functions, and IAM for secure and reliable data access.
  • Managed test scripts and SQL queries in GitHub, maintaining version history for regression testing.

Senior GCP Data Tester

Bell Canada
04.2021 - 08.2024
  • Led data migration testing from on-prem Hadoop to GCP, validating row counts, data integrity, and transformation logic for advertising event data.
  • Responsible for all Client QA Deliverables and part of SQARB (Quality Review Board) across Verticals.
  • Lead Products - Requirements , Roadmap, Acceptance Criteria ,Heads QA Executions and Deliverables.
  • Performed real-time streaming data validation for Kafka and Spark Streaming pipelines carrying ad impressions, clicks, and conversion events.
  • Built PySpark-based data quality frameworks to detect anomalies such as sudden traffic spikes, missing attribution fields, or schema mismatches in ad logs.
  • Validated Salesforce marketing data ingestion into BigQuery, ensuring campaign, lead, and opportunity data matched across CRM and analytics systems.
  • Validated ad platform ingestion pipelines (Google Ads, Facebook) for accuracy, completeness, and freshness against API source reports.
  • Verified audience segmentation logic in data pipelines, ensuring correct targeting and exclusion lists for campaigns.
  • Validated attribution models (first-touch, last-touch, multi-touch) to ensure correct credit assignment across marketing channels.
  • Tested campaign performance metrics (CTR, CPC, CPM, ROAS) for accuracy across dashboards and raw data tables.
  • Worked with developers to troubleshoot ETL defects in ad data pipelines, logging issues in Jira and validating fixes in lower environments.
  • Automated end-to-end test execution in CI/CD pipelines using Docker and Jenkins, integrating with Airflow DAG run validations.

Data Test Engineer

GAP
01.2019 - 03.2021
  • Tested Azure Data Factory pipelines for retail demand forecasting data ingestion.
  • Conducted data accuracy and transformation testing in Databricks using PySpark.
  • Validated CDC-enabled pipelines for incremental data loads.
  • Built and executed SQL-based test scripts for Azure Synapse data marts.
  • Performed end-to-end reconciliation between raw retail sales data in ADLS and aggregated metrics in Power BI dashboards.
  • Validated error handling and retry mechanisms in Azure Data Factory pipelines for resilience during data ingestion failures.
  • Tested Power BI dashboards for data correctness against source data.

Data Tester

ONECount
06.2016 - 12.2018
  • Designed and executed real-time and batch data testing for AWS Glue and Kinesis pipelines.
  • Validated customer identity resolution logic for accuracy and performance.
  • Performed data consistency checks between S3, Redshift, and downstream analytics platforms.
  • Tested ETL transformations in AWS Glue to ensure correct mapping, filtering, and enrichment of streaming and batch datasets.
  • Verified event ordering and latency in Kinesis data streams to maintain sequence integrity for downstream analytics.
  • Automated data reconciliation scripts in Python and SQL to compare S3 raw data with processed Redshift tables on a scheduled basis.
  • Created Tableau dashboards for test result tracking and monitoring.

Java Developer

LIMS
07.2012 - 05.2014
  • Created and executed integration test cases to validate interactions between application modules and external systems.
  • Performed database validation testing using SQL queries to ensure data accuracy after CRUD operations.
  • Implemented automated regression test suites in JUnit to verify functionality after code changes.
  • Conducted API testing for REST and SOAP services to ensure correct request/response handling and data exchange.

Education

M.Tech - Software Technology

VIT University
Vellore, India
05-2016

B.Tech - Electronics & Communication Engineering

Anna University
Chennai, India
04-2011

Skills

Data Validation & Testing: Data quality checks, ETL testing, big data validation, cloud data warehouse testing, regression testing, back-end testing, source-to-target validation, data migration testing

Certification

Google Cloud Certified – Professional Data Engineer

Interests

Solution Design, Generative AI, Machine Learning, LLM, Music & Cricket, Prompt Engineering

IN DETAIL

  • Handles Product Backlogs and User Story Prioritizing, Ensure Product Quality and Documentation.
  • Understanding of ML models and methodologies for QA’ing training data. Incremental Service Automation.
  • Good collaboration and stakeholder management skills.
  • Understanding the customer and human centric designs and driving the quality of the teams.
  • Experience in functional and NF Testing and system back-out with Virtualization.
  • Manage & Handles Internal and External API Documentation

Timeline

Senior GCP Data Test Engineer

Walmart Labs
10.2024 - Current

Senior GCP Data Tester

Bell Canada
04.2021 - 08.2024

Data Test Engineer

GAP
01.2019 - 03.2021

Data Tester

ONECount
06.2016 - 12.2018

Java Developer

LIMS
07.2012 - 05.2014

M.Tech - Software Technology

VIT University

B.Tech - Electronics & Communication Engineering

Anna University
Vanitha VSenior Data Test Engineer