
Highly skilled and motivated Data Engineer with a proven track record of designing, building, and optimizing large-scale data pipelines and systems. Proficient in leveraging cutting-edge technologies to extract, transform, and load (ETL) data from diverse sources, ensuring high data quality, integrity, and accessibility. Adept at creating robust data architectures to support complex analytical processes and drive data-driven decision-making for businesses. Strong communication and collaboration skills, enabling seamless cross-functional teamwork and effective project delivery.
Programming Languages: Python, Java, UNIX, Shell Scripting
Databases & Query Languages: Oracle, MySQL, MongoDB, PostgreSQL, SQL Server, Spark SQL, Snowflake
Tools and Technologies: Juypter Notebook, Tableau, Microsoft Office Suite (Word, Power Point, Excel), Operating Systems (LINUX, UNIX, WINDOWS), SAS, SPSS, MATLAB, RDBMS, Machine learning (Linear regression, SVM, Decision tree, KNN, Navie bayes, Random Forest, K-means, Logistic)
Hadoop & Big Data: HDFS, Hive, Sqoop, Spark Streaming, Kafka, MapReduce, Pig, HBase, Oozie, Airflow
Cloud: MS Azure