Summary
Overview
Work History
Education
Skills
Certification
Languages
Projects
Timeline
Generic

HIROAKI OSHIMA

San Jose,CA

Summary

Software Engineer with passion and expertise in Machine Learning and Big Data system design

Overview

5
5
years of professional experience
1
1
Certification

Work History

Machine Learning Software Engineer (Volunteer)

DataKind
09.2024 - Current
  • Created interactive geographical visualization software that represents housing affordability and renters burden of all 3000 U.S. counties as a commission to policy makers and housing authority, deployed and host it on AWS Environment
  • Trained and developed Forecasting ML model to predict future demand for affordable housing based on economic variables and current inventories


Cloud Data Engineer

Blue River Technology
07.2021 - 11.2023
  • Worked as big data engineer and partnered with data scientists and robotics software engineers to develop autonomous tractor See & Spray
  • Built Data Lakehouse with AWS and Databricks to solve scalability problem for data replication pipelines. Shortened the replication lag from 24 hours to 1 hour while keeping the cost largely same. Enabled more timely data analysis and dashboard creation
  • Architected and built ETL pipelines between multiple organizations that processes large amount of machinery data (> 1TB) daily with AWS and Spark while ensuring data security and access control for stakeholders
  • Act as a consultant and regularly held office hours to troubleshoot data pipelines built by data scientists and robotics software engineer. Optimized their pipelines and unblocked multiple workflows bottle-necked by slow data pipeline execution.
  • Built data lake and cataloged metadata for imagery data on AWS environment. Offloaded query from the original database (Mongodb), resulted in 40-200x faster query executions

Software Engineer Intern

FriendlyRobots.co
11.2019 - 02.2020
  • Built AWS cloud infrastructure and CI/CD pipeline for self-driving vacuuming robot application to
    automate the build-deploy-testing process, which made the development cycle 60% faster
  • Containerized ROS and simulator applications to deploy the functional tests to AWS cloud.
  • Created alarms in CloudWatch service for monitoring the robot application’s performance, memory and CPU usage

Education

Bachelor of Arts - Data Science Applied Math And Modeling Focus

University of California, Berkeley
Berkeley, CA
08-2019

Skills

Python, Javascript, SQL, Docker, Kubernetes, Apache Spark, Terraform, AWS, Databricks, Sagemaker, Grafana, Github

Certification

AWS Certified Machine Learning Engineer – Associate

Languages

English
Native or Bilingual
Japanese
Native or Bilingual
Chinese (Mandarin)
Professional Working

Projects

Choropleth Map for Housing Affordability of US Counties

  • Utilized data provided by Housing and Urban Development and Created a interactive map that visually represents how many people per capita are paying more than 50% of their income on housing costs in all 3000+ US counties. The map highlights areas with particularly high housing costs compared to their medium income.

Affordable Housing Inventory Demand Forecast Model

  • Researched and collected affordable housing and economic data, and developed and trained a time-series forecasting model that predicts the required number of affordable housing inventory for each county based on economic variables such as historical rental cost, median house hold income, cost burden and homelessness

Timeline

Machine Learning Software Engineer (Volunteer)

DataKind
09.2024 - Current

Cloud Data Engineer

Blue River Technology
07.2021 - 11.2023

Software Engineer Intern

FriendlyRobots.co
11.2019 - 02.2020

Bachelor of Arts - Data Science Applied Math And Modeling Focus

University of California, Berkeley
HIROAKI OSHIMA