Data Engineer with 3+ years of experience in designing and optimizing data pipelines on AWS and Azure. Expert in ETL processes, big data (Hadoop, Spark, Kafka), and programming in Python, SQL, and Scala. Proven success in cloud migrations, infrastructure automation (Terraform), workflow orchestration (Airflow), data visualization (Tableau, Power BI), financial analysis, machine learning, and IoT. MS in Computer Science with multiple academic excellence awards. Motivated to tackle new challenges.
Programming Languages: Python 37/27, C, C, SQL
Database Tools: Oracle, MS SQL Server, MySQL, PL/SQL, Teradata
Reporting Tools: Power BI, Tableau
Web Programming: HTML, CSS
Cloud Technologies: AWS (S3, EC2, EMR, Lambda, RDS), Azure (Data Factory, Data Lake, Databricks, Synapse Analytics)
Data Formats: CSV, JSON, TXT, XML
Operating systems: Windows, Mac, Linux, Unix
Technologies/Tools/IDEs: PyCharm, Visual Studio, Jupyter Notebook, Eclipse, DBeaver
Big Data Technologies: Hadoop, Spark, Hive, Kafka, MapReduce