Summary
Education
Skills
Status
Technicalskills
Projects
Extracurricular Activities
Hobbies and Interests
Timeline
Generic

Mir Mahmudul Hasan

Queens,NY

Summary

Motivated graduate in Computer Science and Information Technology with a strong foundation in data engineering tools and technologies, including big data, Hadoop, Cloudera, SQL, Linux, Tableau, Active Directory, LDAP, Kerberos, AWS, Python, and more. Driven to apply academic knowledge and technical skills to make meaningful contributions to data analytics and engineering projects. Committed to continuously expanding expertise in the IT industry.

Education

Bachelor of Science - Computer Science and Information Technology

Shanto-Mariam University of Creative Technology
Dhaka
06.2020

Skills

  • Python
  • SQL
  • Hadoop
  • Cloudera
  • Linux
  • Apache Spark
  • MySQL
  • PostgreSQL
  • Hbase
  • Git
  • Tableau
  • Data cleaning
  • Data visualization and presentations
  • Database management
  • Data validation
  • ETL processes
  • Data warehousing
  • Data science
  • Software development life cycle (SDLC)

Hadoop ecosystem

Status

U.S. Citizen

Technicalskills

Python, Data manipulation, scripting, automation, SQL, Database querying, data extraction, reporting, Hadoop, HDFS, YARN, MapReduce, Cloudera, Experience with Hadoop ecosystems and cluster management, Linux, File systems, shell scripting, process management, Apache Spark, Basic understanding of distributed data processing, MySQL, PostgreSQL, Hbase, Git, Source code management and version control

Projects

Big Data Processing with Hadoop and Python

● Implemented a data processing pipeline using Hadoop to process large datasets in a distributed environment.

● Utilized HDFS for storing data and MapReduce for processing it efficiently.

● Wrote Python scripts to interact with the Hadoop ecosystem and automate data processing tasks.

SQL-Based Data Analysis and Reporting

● Developed SQL queries for data extraction and aggregation from relational databases for business insights.

● Created reports and dashboards that visualize key performance indicators (KPIs) and trends in data.

Cloud Data Engineering Project

● Used Cloudera to set up a small Hadoop cluster and execute data transformation jobs.

● Integrated Python scripts with Hadoop to automate data cleansing and ETL (Extract, Transform, Load) processes.

● Gained practical experience working in a Linux-based environment to run distributed data processing tasks.

Extracurricular Activities

Data Engineering Club — Member, Shanto-Mariam University of Creative Technology.

● Collaborated with peers on projects to build data pipelines using open-source tools like Apache Hadoop and Spark.

● Participated in workshops on big data technologies and attended industry webinars on cloud-based data engineering.

Hobbies and Interests

  • Data engineering
  • Machine learning
  • Cloud computing
  • Open-source technologies

Timeline

Bachelor of Science - Computer Science and Information Technology

Shanto-Mariam University of Creative Technology
Mir Mahmudul Hasan