Having Around 13 Months of Professional IT Experience as an Data Engineer.
Hands on Experience on data ingestion, data processing and data optimization (Databricks, Data Lake, Blob storage, Azure SQL& Data Factory)
Have Knowledge on integrating Azure Databricks with other Azure services such as Azure Data Lake Storage, Azure Event Hub.
Hands-on experience with Cloud Storage and Cloud SQL for data storage and management
Involved in building, and deploying scalable and reliable data workflows using Apache Airflow
Experience in working with Azure Power BI for data visualization.
Good understanding of the core concepts of Apache Airflow, such as Directed Acyclic Graphs (DAGs), Operators, Sensors, and Executors
Experience with Agile/Scrum methodology.
Experience in working in 24X7 Support and used to meet deadlines, adaptable to ever changing priorities.
Hands on Experience on writing SQL Queries as part of testing the Data.
Excellent Communication, Presentation, Interpersonal, Strong Troubleshooting and Organization Skills.
Overview
1
1
year of professional experience
Work History
Jr Data Engineer
Anthem
06.2021 - 07.2022
Involved in creating new pipelines on Azure Data Factory as data ingestion, worked with Azure Databricks (Spark) for data prep using Pyspark and load the data into Azure Storage and Azure Synapse for reporting and data science team
Reading files (Parquet, CSV) from Azure Data Lake, Blobs to Data frames using PySpark
Written different API scripts in python and scheduled the jobs using a computer engine
Creating the Measures according to the business logic in Power BI Desktop using Data analysis expressions (DAX)
Responsible for creating SQL datasets for Power BI and Ad-hoc Reports
Experience in building power bi reports on Azure Analysis services for better visual presentation
Have knowledge in integrating Apache Airflow with other tools and technologies commonly used in data engineering and data science workflows, such as Apache Spark, Hadoop, and various databases
Knowledgeable in working with various Airflow components such as Operators, Sensors, Hooks, and Executors to execute tasks and interact with external systems
Have hands-on experience in Python programming language, with a strong understanding of its syntax, data structures, and object-oriented principles
Proficient in SQL databases MSSQL Server, MySQL (RDBMS), Oracle DB
Worked on Spark Data frames, Temporary Tables for transformations
Created Python notebooks on Azure Databricks for processing the datasets and loading them into Azure SQL databases.
Education
Master’s degree - Data Engineering
Lewis University
Mar 2024
Bachelor’s degree - information technology
JNUTH College of Engineering
May 2021
Skills
Technical Skills
BI Tools:
Power BI Desktop & Power BI Service
Data Bases: SQL, T-SQL, Azure Data Lake Storage, Azure SQL
Operating Systems: Windows 07/XP
Microsoft Tools: MS-Office, MS-Power BI
ETL Tools: Azure Data Factory, Power Query
Data Warehousing: Star/Snowflake Schema, Data Marts
Data Visualization:
Power BI, MS Excel
Troubleshooting Tools: SQL Server Profiler, DAX Studio
Cloud Technologies: Microsoft Azure, Azure Data Bricks, Azure Synapse Analyzer
Programming Languages: Python, PySpark
Accomplishments
CERTIFICATIONS:
Responsible Conduct of Research (RCR) Basic 102255 - CITI Program