Resourceful AWS Data Engineer experienced in evaluating and assessing client requirements and implementing infrastructure to solve identified problems. Harnessed code and cloud-native technologies to create scalable and user-eccentric systems. Strong negotiator with excellent value-driven solutions.
Big Data Technologies: Hadoop, MapReduce, HDFS, Sqoop, PIG, Hive, HBase, Oozie, Flume, NiFi, Kafka, Zookeeper, Yarn, Apache Spark, Mahout, Sparklib
Databases: Oracle, MySQL, SQL Server, MongoDB, Cassandra, DynamoDB, PostgreSQL, Teradata, Cosmos
Programming: Python, PySpark, Scala, Java, C, C, Shell script, Perl script, SQL
Cloud Technologies: AWS, Microsoft Azure
Frameworks: Django REST framework, MVC, Hortonworks
Tools: PyCharm, Eclipse, Visual Studio, SQL*Plus, SQL Developer, TOAD, SQL Navigator, Query Analyzer, SQL Server Management Studio, SQL Assistance, Eclipse, Postman
Versioning tools: SVN, Git, GitHub
Operating Systems: Windows 7/8/XP/2008/2012, Ubuntu Linux, MacOS
Network Security: Kerberos
Database Modelling: Dimension Modeling, ER Modeling, Star Schema Modeling, Snowflake Modeling
Monitoring Tool: Apache Airflow
Visualization/ Reporting: Tableau, ggplot2, matplotlib, SSRS and Power BI
Machine Learning Techniques: Linear & Logistic Regression, Classification and Regression Trees, Random Forest, Associative rules, NLP and Clustering