Detail-oriented and technically proficient Data Engineer with a strong foundation in Python, SQL, big data technologies, and cloud computing. Experienced in developing data pipelines, managing large-scale datasets, and implementing analytics solutions using tools like Apache Spark, Hadoop, and AWS. Skilled in database querying, data modeling, and applying statistical methods to extract actionable insights. Demonstrated ability to work in Agile teams, contribute to embedded system development, and collaborate with senior engineers on mission-critical projects. Passionate about building scalable data infrastructure and deploying machine learning models to drive informed decision-making.
Python, Java, C, SQL, Apache Spark (MLLib, SQL, Streaming), Hadoop (HDFS, YARN, MapReduce), Hive, Sqoop, AWS (EC2, S3, EMR, SWF, AWS CLI), Git, JIRA, Linux CLI, Shell Scripting, ETL, Data Modeling, Machine Learning (Basic), Data Visualization (Matplotlib, Power BI), Descriptive & Inferential Statistics, Probability, Linear Algebra, Hypothesis Testing, Regression
Full Stack Data Science
Full Stack Data Science