Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic

Rahul Sai Akina

Tampa,FL

Summary

Results-driven Data Engineer with a proven track record at Country Financial, specializing in ETL pipeline development and data migration. Expert in Spark and Snowflake, achieving a 30% reduction in processing time. Strong collaborator with a focus on data integrity and security, leveraging Python, Azure, and AWS technologies to deliver impactful solutions.

Overview

6
6
years of professional experience
1
1
Certification

Work History

Azure Data Engineer

Country Financial
Bloomington, IL
06.2024 - Current
  • Executed a one-time data migration from SQL Server to Snowflake using Python and SnowSQL.
  • Developed ETL pipelines for seamless data flow in and out of the data warehouse.
  • Created reusable Hive UDF libraries to enhance querying capabilities for users.
  • Designed and integrated data ingestion and aggregation within Hadoop environment.
  • Built a Spark streaming application to format raw packet data from Kafka into JSON.
  • Optimized Spark jobs on Databricks, reducing processing runtime by 30% for large datasets.
  • Utilized Elasticsearch and Kibana for indexing and visualizing real-time analytics results.
  • Managed project lifecycle encompassing design, development, testing, deployment, and support.

Data Engineer

Baxter
Deerfield, IL
05.2023 - 01.2024
  • Implemented data security measures, ensuring compliance with AWS policies and industry regulations.
  • Developed processes to guarantee data quality and integrity across systems.
  • Optimized AWS Lambda performance, achieving 40% reduction in execution costs through efficient settings.
  • Executed data loading from BDW Oracle and Teradata into HDFS using Sqoop.
  • Created interactive web screens utilizing AJAX, JSON, and JavaScript technologies.
  • Automated infrastructure management with Terraform, reducing overhead by 90% while enhancing uptime.
  • Designed User-Defined Functions (UDFs) in Scala and PySpark to address business-specific needs.
  • Conducted performance tuning of Snowflake data warehouse, improving query execution times.

Big Data Engineer

CreditAccess Grameen
Bangalore, India
03.2021 - 07.2022
  • Managed data storage and processing on GCP, selecting optimal storage solutions like Cloud Storage and Cloud Datastore.
  • Executed transformations using Spark, saving results back to HDFS for integration with Snowflake.
  • Configured and scheduled cluster resources via Azure Kubernetes Service to enhance operational efficiency.
  • Created Data Studio reports for billing insights, optimizing queries to support cost-saving initiatives.
  • Developed multiple PySpark and Spark SQL notebooks in Databricks for data extraction and transformation based on business needs.
  • Designed data integration solutions with Azure Data Factory, facilitating data movement between on-premises and cloud systems.
  • Built scalable, real-time data pipelines using Google Cloud Dataflow and Apache Beam to process high-volume event data.
  • Monitored GCP services with Google Cloud SDK, improving incident resolution speed and system reliability.

Data Engineer

Lifestyle International Pvt. Ltd
Bangalore, India
09.2019 - 02.2021
  • Collaborated with data analysts and business stakeholders to define reporting and analysis requirements.
  • Designed and implemented efficient ETL pipelines using Python API (PySpark) of Apache Spark.
  • Managed loading and transformation of large structured and semi-structured datasets, executing Hive queries for analysis.
  • Scheduled and optimized job execution on Azure virtual machines with Control-M for resource allocation.
  • Utilized Apache Sqoop for bulk data transfers between Apache Hadoop and Oracle databases for forecasting.
  • Ingested real-time weblogs via Kafka into Spark Streaming, conducting data quality checks and flagging results.
  • Provisioned Databricks clusters for batch and streaming data processing, ensuring required libraries were installed.
  • Authored JSON scripts for Azure Data Factory (ADF) pipeline deployment, orchestrating SQL-based data processing.

Education

Master of Science - Management Information Systems

University of Illinois At Springfield
Springfield, IL
12-2023

Skills

  • Big Data Technologies: HDFS, Hue, MapReduce, PIG, Hive, HCatalog, HBase, Sqoop, Impala, Zookeeper, Flume, Kafka, Yarn, Cloudera Manager, Kerberos, Pyspark, Airflow, Kafka, Snowflake, Spark Components
  • AWS: S3, Redshift, Lambda, EMR
  • Azure: Azure Data Lake, Azure Synapse Analytics, Azure Databricks
  • GCP: Big Query, Cloud Dataflow, Cloud Storage
  • Visualization & ETL tools: Tableau, PowerBI, Informatica, Talend
  • Programming Languages: Python, SQL, Java/Scala
  • Web/Application server: Apache Tomcat, WebLogic, WebSphere
  • Version controls and Tools: GIT, Maven, SBT, CBT
  • Data Processing Frameworks: Apache Hadoop, Apache Spark, Apache Flink, Apache Beam

Certification

  • Microsoft Certified Azure Data Fundamentals
  • AWS Certified Data Engineer Associate

Timeline

Azure Data Engineer

Country Financial
06.2024 - Current

Data Engineer

Baxter
05.2023 - 01.2024

Big Data Engineer

CreditAccess Grameen
03.2021 - 07.2022

Data Engineer

Lifestyle International Pvt. Ltd
09.2019 - 02.2021

Master of Science - Management Information Systems

University of Illinois At Springfield
Rahul Sai Akina
Want your own profile? Create for free at Resume-Now.com