
Results-driven Data Engineer with 8+ years of experience in Big Data technologies, specializing in building scalable data pipelines and large-scale data transformations. Expertise in Hadoop, Sqoop, Hive, Spark, AWS, Azure, SQL, and Python. Strong background in distributed computing, cloud platforms, and real-time data processing. Passionate about optimizing performance and driving business insights through data engineering solutions.
Hadoop
MapReduce
Yarn
Hive
Pig
HBase
Kafka
Oozie
Spark
RDD
DataFrame
Dataset API
Spark SQL
Spark Streaming
Python
PySpark
Pandas
NumPy
Matplotlib
Seaborn
Java
Scala
Shell Scripting
AWS
S3
EMR
Glue
Redshift
Lambda
Microsoft Certified, Azure Fundamentals – Microsoft Corporation.