Dynamic Sr. Data Engineer with extensive experience at Tech Mahindra, specializing in building robust data pipelines and optimizing ETL processes using Python and Airflow. Proven ability to enhance data quality and streamline workflows, showcasing strong analytical skills and effective collaboration with cross-functional teams to drive impactful data solutions.
Environment: Python, BigQuery, Cloud Composer/Airflow, MySQL, Cloud Storage, GitHub.
Environment: GCP, SQL, PySpark, Spark SQL, Astronomer/Airflow, Hive, GitHub.
Environment: GCP, SQL, PySpark, Logstash, Teradata, GitHub
Environment: GCP, SQL, Spark- PySpark, Spark SQL, Logstash, Kibana, Jenkins, GitHub
Environment: Hive, Zookeeper, Talend, and GitHub..
Environment: IDQ Developer & Analyst, PowerCenter Designer, Workflow Manager , MS SQLserver
Tableau, Custom Shell Scripts, Splunk, Grafana, Maven, Git, SVN, Jenkins, SQL, JavaScript, Shell Scripting, Python, HiveQL, Oracle, MY SQL, MS SQL Server, Teradata, Postgres SQL, Informatica PowerCenter, Infoworks, Linux, Unix, Windows 8, Windows 7, Windows Server 2008/2003, S3, Redshift, EMR, Lambda, Setup, configuration, data streaming, integration , Informatica MDM , Informatica IDQ