Summary
Overview
Work History
Education
Skills
Timeline
Generic

Prem Zakkam

Evans,GA

Summary

  • A skilled Data Engineer with 8+ years of Progressively responsible experience in Data Engineering, Business Intelligence, Data Analysis, ETL, Data warehousing and developing Software Applications.
  • Worked on architecting, designing, and implementation of large-scale data and analytics solutions on Snowflake Cloud Data Warehouse.
  • Efficient in preprocessing data including Data cleaning, Correlation analysis, Imputation,Visualization, Feature Scaling and Dimensionality Reduction techniques using Machine learning platforms like Python Data Science Packages (Scikit-Learn, Pandas, NumPy).
  • Proficient in Azure Data & Analytics PaaS Services: Azure Data Factory, Azure Data Lake, Azure Synapse Analytics, Azure Databricks, Azure IoT, Azure HDInsight + Spark, Azure Cosmos DB, Azure Stream Analytics, and Azure SQL DB.
  • Skilled in conducting data profiling, cataloging, and mapping for technical design and construction of technical data flows.
  • Knowledge of data privacy and security regulations and best practices related to Azure and AI solutions.
  • Proficient in leveraging AI/ML libraries and frameworks such as scikit-learn, TensorFlow, and PyTorch to design scalable and robust solutions. Implemented and fine-tuned Large Language Models for natural language processing tasks, including text summarization, sentiment analysis, and language translation.
  • Proficient in Data Migration, Data Profiling, Data Ingestion, Data Cleansing, Transformation, Data Import, and Export using multiple Cloud-Based ETL tools, including Azure Data Factory.
  • Developed ETL models on Azure Data Factory using Scala to migrate data into Snowflake. Strong experience and knowledge in Data Visualization with Tableau, Power BI (DAX functions, Enterprise gateway, Personal gateway), and integration with various data sources.
  • Building the Tableau dashboards to provide the effectiveness of weekly campaigns and customer acquisition.

Overview

9
9
years of professional experience

Work History

Sr. Data Engineer

AssistRx
07.2022 - Current
  • Design and develop scalable, efficient, and reliable data pipelines and architectures to support the company's data needs. This involves understanding business requirements and translating them into technical solutions.
  • Leading end-to-end Big Data projects on Azure, engaging stakeholders, gathering requirements, and coordinating technical aspects.
  • Optimize data pipelines and processes for performance, scalability, and efficiency. This includes tuning database queries, optimizing ETL workflows, and implementing caching mechanisms.
  • Collaborated with cross-functional teams to define requirements and develop end-to-end solutions for complex data engineering projects.
  • Leveraging Azure Synapse Analytics (formerly Azure SQL Data Warehouse) for data handling capabilities, employing Azure Databricks, PySpark, and Azure Stream Analytics for comprehensive data understanding.
  • Designing and enforcing data security measures in Azure Synapse Analytics, managing ETL solutions, and automating operational processes.
  • Employing Azure Synapse Analytics to transform and move substantial amounts of data between different Azure data stores and databases.
  • Configuring Azure Synapse Analytics objects such as landing tables, staging tables, and queries, leveraging streaming data from sources like Azure Event Hubs and storing in NoSQL databases.
  • Collaborating with IT teams for smooth data operations, including integration with other systems and platforms.
  • Implementing Azure native services like Azure Event Hubs, Azure Service Bus, and Azure Functions, managing cross-functional dependencies and custom library generation.
  • Ran statistical analyses within software to process large datasets.
  • Prepared documentation and analytic reports, delivering summarized results, analysis and conclusions to stakeholders.

Data Engineer

Atmos Energy
01.2019 - 06.2022
  • Worked on architecting, designing, and implementation of large-scale data and analytics solutions on Snowflake Cloud Data Warehouse.
  • Design of workflow which includes setting up DEV, QA, and PROD environments, creating users, and managing their permissions.
  • Copy activity, Custom ETL Pipeline Activities for On-cloud ETL processing using various source systems like SAP, Salesforce, Various enterprise-level applications.
  • Migrating data from on-premises databases (SQL Server) to Cloud databases/Snowflake.
  • Implemented snow pipes to auto ingest data from various file formats (Parquet, JSON, CSV) to snowflake tables.
  • Creating Complex SQL Queries using Views, Indexes, Triggers, Roles, Stored procedures, and User Define Functions.
  • Developed scripts (Python) to do Extract, Load, and Transform data.
  • Design & implement strategies to build new data quality frameworks to replace old systems in place.
  • Working with Agile environment and using rally tool to maintain the user stories and tasks.
  • Collaborate with business/user groups to gather requirements and design solutions using Azure cloud "big data" services.
  • Develop event-driven data architectures with Azure Event Grid and Azure Functions, creating pipelines and data flows using Python and Spark in Azure Databricks.
  • Automate job execution in Azure Data Factory using triggers and provision Azure Databricks clusters for optimal processing.
  • Develop Spark applications with PySpark and Spark SQL for data extraction, transformation, and aggregation, specifically for Azure Synapse Analytics data analytics.
  • Monitor, automate, and refine data engineering solutions using Azure Monitor and Azure Automation for seamless data management and processing.
  • Implement CI/CD pipelines in Azure DevOps with secure endpoint connections to ensure integrity and security of data during deployments.

Jr. Data Engineer

Bank Of America
06.2015 - 12.2017
  • Worked on data integration projects using ETL tools like SSIS, Informatica, and Talend Studio to extract data from various sources like Oracle, MySQL, SQL Server, and load into Snowflake cloud data warehouse.
  • Developed and Maintained Enrich layer in SQL server for Marketing Analytics Team.
  • Experience working with cross-functional teams distributed across the globe.
  • Assisted solution providers with the definition and implementation of technical.
  • Primarily Involved in Data Modelling in PowerBI and Maintaining workspaces and deployment pipelines.
  • Designed and developed data ingestion pipelines using Nifi and Sqoop to move data between Hadoop and other data systems like AWS S3, Azure Blob storage, and Redshift.
  • Created and maintained data workflows using Apache Airflow to schedule and monitor ETL jobs, ensuring data quality and accuracy.
  • Helped team with Automating tasks using Python and SQL.
  • Developed Talend studio building jobs for data migration and data warehousing on multiple projects.

Education

Master of Science - Computer Science

Campbellsville University
Campbellsville, KY
12.2018

Bachelor of Technology - Electronics & Communication Engineering

Sree Datta College of Engineering & Science
Hyderabad, India
05.2016

Skills

  • Methodologies:
  • SDLC, Agile, Waterfall, Scrum

  • Languages:
  • Python, R, C, Scala, SQL, Unix Shell Script Java, C#

  • Big Data:
  • Hadoop, HDFS, Yarn, Sqoop, Oozie, Hive, HBase, Spark, Impala, Nifi, Cassandra, Apache Airflow, Databricks

  • ETL/ELT Tools:
  • SSRS, SSIS, SSAS, Informatica, Matillion, Azure Data factory, DB

  • Databases:
  • MySQL, SQL Server, Snowflake cloud, SQL, NoSQL, MySQL, SQL Server DB2, PostgreSQL, Oracle, MongoDB

  • Tools/ IDE/ Build Tools:
  • PowerBI, Tableau, Talend Studio, Git, Git Bash, Eclipse, IntelliJ, Maven, ANT, Jenkins, GitHub, Jira, Snowflakes, Bitbucket, Data pipelines

  • Cloud Computing:

    AWS (S3, CloudWatch, Athena, RedShift, EMR, EC2, DynamoDB), Azure (Azure Data Factory, Azure Blob, Azure Databricks), IAM, Secret Manager, S3, Lambda, CloudWatch, Messaging Queue (SNS & SQS), Azure - ADF, Blob Storage

  • Data Analytics Skills:
  • Data Cleaning, Data Masking, Data Manipulation, Data Visualization, Data Analysis

  • Digital Ocean:
  • Droplets, Spaces

  • BI & CRM Tools:
  • Tableau, Microsoft Business Intelligence (Power BI), Sigma Computing

  • Packages:
  • NumPy, Pandas, Matplotlib, SciPy Scikit-learn, Seaborn, TensorFlow

  • File Formats:
  • Parquet, Avro, ORC, JSON

  • Operating System:
  • Windows, Linux, Unix, Macos

Timeline

Sr. Data Engineer

AssistRx
07.2022 - Current

Data Engineer

Atmos Energy
01.2019 - 06.2022

Jr. Data Engineer

Bank Of America
06.2015 - 12.2017

Master of Science - Computer Science

Campbellsville University

Bachelor of Technology - Electronics & Communication Engineering

Sree Datta College of Engineering & Science
Prem Zakkam