
Highly skilled and dedicated Data Engineer with over 6 years of experience in software analysis, design, development, and implementation of Cloud and Big Data solutions. Proficient in leveraging technologies such as Big Query, Spark, Scala, Hadoop, and Oracle Database to build and maintain robust data pipelines. Hands on experience in Normalization and De - Normalization techniques for optimum performance in relational and dimensional database environments. Skilled in leveraging innovative technologies and approaches to renovate, extend, and transform core data assets, including SQL-based, NoSQL-based, and Cloud-based data platforms. Extensive expertise in developing data models, pipeline architectures, and providing ETL solutions for project models. Managed end-to-end operations of ETL data pipelines using Matillion on AWS Cloud Services and Azure Data Factory on Azure Cloud Services, ensuring seamless data ingestion, Data Processing/Transformation,Data Curation. Proven ability to design and specify Informatica ETL processes, optimizing schema loading and performance. Skilled in ETL architecture design and implementation, consistently delivering high-performance solutions. Certified in software engineering concepts, with hands-on experience in system design, application development, testing, and operational stability. Proficient in coding using modern programming languages and database querying languages, ensuring efficient and maintainable code. Utilized JIRA as a project management tool to effectively track and prioritize data engineering tasks, ensuring timely delivery of projects and seamless collaboration with cross-functional teams. Experienced in working with Agile methodologies, facilitating iterative and incremental development cycles, and promoting efficient communication and collaboration within the team. Knowledge and skills in secondary tools such as Microsoft Azure, SQL data warehouse, PolyBase, and Visual Studio. Proficient in SQL and other relational databases. Experienced in integrating Power BI reports into other applications using embedded analytics (Power BI service or API automation) and developing custom visuals for Power BI. Proficient in utilizing Python for data manipulation, analysis, and scripting, enabling efficient data processing and transformation. Skilled in PySpark, leveraging the power of Apache Spark for distributed data processing, machine learning, and real-time analytics. Strong troubleshooting and problem-solving skills, capable of identifying and resolving.