Summary
Overview
Work History
Education
Skills
Professional Accomplishments
Timeline
Generic

Ketan Naulakha

Summary

Result-driven Data Engineer with over 4 years of experience in designing and implementing data pipelines, cloud solutions, and data modeling. Specialized in the Media and Entertainment domain, with a proven track record of improving system efficiency and storage optimization. Skilled at collaborating with clients to deliver customized solutions, leveraging advanced big data technologies to meet business goals.

Overview

4
4
years of professional experience

Work History

Data Engineer

InfoCepts
, India
08.2023 - Current
  • Led a team of 2 members, overseeing the development of custom Spark jobs.
  • Increased storage efficiency by 30% through Iceberg, with ZSTD compression format.
  • Migrated 30+ SQL stored procedures to Scala using Dataset API, enhancing maintainability.
  • Successfully transitioned query execution from SingleStore to Amazon Athena.
  • Enhanced project architecture and debugging efficiency by 70%.

Cloud Data Engineer

InfoCepts
, India
02.2022 - 07.2023
  • Designed and executed Spark jobs for various data processes, reducing processing time by 30% and improving system efficiency by 25%.
  • Increased test case coverage by 60% through 90+ functional unit tests integrated into GitLab CI/CD pipelines.
  • Automated the creation of SQS queues with tailored policies using CloudFormation templates.
  • Deployed two serverless EMR solutions, cutting costs by over $4k for three clients.
  • Developed basic GET/POST APIs for business user interfaces.

Associate Data Analyst

InfoCepts
, India
12.2020 - 01.2022
  • Translated complex BTEQ scripts in Teradata into Spark SQL for Databricks execution.
  • Played a key role in developing an ETL pipeline with custom Spark jobs for large-scale data handling.
  • Optimized SQL queries to streamline data imports from raw layers.

Education

Bachelor of Technology - Information Technology

SRM Institute of Science And Technology
Chennai, India
03-2020

Skills

  • Big Data Tools: Apache Spark, Databricks,Iceberg, Snowflake
  • Programming Languages: SQL, Scala, Python
  • Cloud Services: AWS (Lambda, EMR Serverless, Athena, S3, Secrets Manager, SQS, SNS, CloudFormation)
  • Orchestration Tools: Airflow, AWS Step Functions
  • Others: CI/CD (GitLab), Pandas, Semi/Unstructured Data Handling (JSON, Log Files)

Professional Accomplishments

  • AWS Certified Cloud Practitioner
  • Awarded "Dream Team" twice for valuable contributions to projects.

Timeline

Data Engineer

InfoCepts
08.2023 - Current

Cloud Data Engineer

InfoCepts
02.2022 - 07.2023

Associate Data Analyst

InfoCepts
12.2020 - 01.2022

Bachelor of Technology - Information Technology

SRM Institute of Science And Technology
Ketan Naulakha