Motivated graduate in Computer Science and Information Technology with a strong foundation in data engineering tools and technologies, including big data, Hadoop, Cloudera, SQL, Linux, Tableau, Active Directory, LDAP, Kerberos, AWS, Python, and more. Driven to apply academic knowledge and technical skills to make meaningful contributions to data analytics and engineering projects. Committed to continuously expanding expertise in the IT industry.
Hadoop ecosystem
U.S. Citizen
Python, Data manipulation, scripting, automation, SQL, Database querying, data extraction, reporting, Hadoop, HDFS, YARN, MapReduce, Cloudera, Experience with Hadoop ecosystems and cluster management, Linux, File systems, shell scripting, process management, Apache Spark, Basic understanding of distributed data processing, MySQL, PostgreSQL, Hbase, Git, Source code management and version control
Big Data Processing with Hadoop and Python
● Implemented a data processing pipeline using Hadoop to process large datasets in a distributed environment.
● Utilized HDFS for storing data and MapReduce for processing it efficiently.
● Wrote Python scripts to interact with the Hadoop ecosystem and automate data processing tasks.
SQL-Based Data Analysis and Reporting
● Developed SQL queries for data extraction and aggregation from relational databases for business insights.
● Created reports and dashboards that visualize key performance indicators (KPIs) and trends in data.
Cloud Data Engineering Project
● Used Cloudera to set up a small Hadoop cluster and execute data transformation jobs.
● Integrated Python scripts with Hadoop to automate data cleansing and ETL (Extract, Transform, Load) processes.
● Gained practical experience working in a Linux-based environment to run distributed data processing tasks.
Data Engineering Club — Member, Shanto-Mariam University of Creative Technology.
● Collaborated with peers on projects to build data pipelines using open-source tools like Apache Hadoop and Spark.
● Participated in workshops on big data technologies and attended industry webinars on cloud-based data engineering.