
Experienced Senior Data Engineer with over 8 years of expertise in data engineering, metadata management, and data governance. Specialized in Alation, Databricks, Spark, Python, SQL, Azure, and Starburst, with proven ability to design and implement data catalog solutions, governance frameworks, and scalable ETL pipelines. Strong expertise in metadata onboarding, data lineage, data quality, and catalog configuration. Adept at working in cloud environments (Azure, AWS, Snowflake) with experience integrating metadata from SQL Server, Oracle, Azure Cloud, and Snowflake. Hands-on experience with Starburst for data federation and optimizing distributed queries across cloud data platforms. Demonstrated ability to collaborate with cross-functional teams, optimize big data solutions, and drive data governance best practices. Skilled in supporting data visualization and preparation tools such as Power BI, Tableau, and Alteryx, ensuring clean, reliable, and actionable data pipelines for analytical consumption. Extensive hands-on experience in Python scripting for automating data workflows, building transformation logic, and managing orchestration tasks using Airflow and DBT. Proficient in PySpark for processing large-scale distributed datasets in Databricks and AWS Glue environments, optimizing performance and resource usage. Practical experience with Airtable for managing datasets, automating workflows, and integrating external data sources, ensuring smooth collaboration and streamlined reporting. Working knowledge of Palantir Foundry, including experience collaborating on data modeling and workflow design using Workshop and Quiver, with exposure to Slate for user interface components. Created pipelines in ADF using Linked Services/Datasets/Pipeline to Extract, Transform, and Load data from multiple sources like Azure SQL, Blob Storage, and Azure SQL Data Warehouse. Developed JSON scripts for deploying ADF pipelines that process large-scale data using SQL Activities.