
Senior Data Engineer with 8+ years of experience across the full SDLC, specializing in Big Data, Cloud, and GenAI solutions. Expert in building scalable data pipelines using Python, PySpark, Airflow, and Spark on AWS, Azure, and Hadoop ecosystems. Proven success in deploying multimodal GenAI apps (e.g., CLIP, BLIP, LLMs) using FastAPI, Docker, and CI/CD. Skilled in synthetic data generation, model evaluation (BLEU, ROUGE, SHAP), and responsible AI deployment. Hands-on experience with SQL across Redshift, Oracle, PostgreSQL, and Salesforce for high-performance ETL. Proficient in MLOps (SageMaker, Azure ML, MLflow), NoSQL (MongoDB, Cassandra), and Terraform for infrastructure automation. Strong background in data integration, quality, governance, and automation across cloud-native platforms.