•Successful Senior Technical Program Manager with career spanning 14+ years in Complex technical environments.
• 10+ years of experience in Big Data,Snowflake, AWS, Redshift, Python,Pyspark, Hadoop and Various Hadoop ecosystems like-Pig, Hive, Sqoop and Spark.
• Over 12 years of experience in ETL tools like Informatica power Centre, Matillion, StreamSets, oracle application development using ORACLE, RDBMS, SQL, PL/SQL and Reporting tool such as SAP BO.
• Expertise in Gen AI , NLP and LLM models like RAG and Llama
• Expertise in Data Observability features like Freshness, Volume , Distribution etc.
• Expertise in exploratory data analysis, data preprocessing and data modeling.
• Expertise in Big Data Analytics with Spark and Python.
• Experienced in formulating and refining business objectives, data selection, data preparation and model evaluation.
• Experienced in data mining on massive datasets by writing ad-hoc Python scripts, Hive and Pig queries using Hadoop framework.
• Experienced in big data analysis, move data between relational databases and Hadoop using SQOOP, manage data in HDFS, use Pig and Hive to run distributed queries on data.
Sound understanding of Hadoop Map Reduce and YARN framework and interaction among Big Data tools for data sharing and data mining.
• Proficient in Star and snow flake Schemas, Extract, Transform and Load of Data, Requirement Analysis, Design, Development, Testing, Documentation, Implementation of Business Applications and repository management and implementation.
Expertise in Data Warehouse Concepts. Experience in Informatica Performance tuning of Target, Sources, Transformations and Sessions.
AWS -Redshift, EMR, EC2, Lambda