
Results-driven data engineer with 6+ years of experience in Gen AI, Python, Java, Scala, PySpark, and AWS Cloud technologies. Proven expertise in building scalable data pipelines, optimizing Spark jobs (achieving up to 25% performance improvement), and developing distributed systems using Scala and AKKA. Skilled in implementing Retrieval-Augmented Generation (RAG), fine-tuning large language models, and building intelligent chatbots with LangChain. Hands-on experience with Terraform-based infrastructure automation, CI/CD pipelines (Jenkins), and cloud-native solutions leveraging AWS services (Glue, Lambda, Step Functions, S3, EC2). Adept at cost optimization strategies, advanced data modeling, and workflow automation to drive efficiency and business value. Strong collaborator with cross-functional teams, delivering high-impact technical solutions aligned with organizational goals.