Summary
Overview
Work History
Education
Skills
Websites
Certification
Languages
Work Preference
Work Availability
Software
Interests
Timeline
AdministrativeAssistant
Eunjeong (Ariel) Ahn

Eunjeong (Ariel) Ahn

Fullerton,41

Summary

Data Scientist familiar with gathering, cleaning and organizing data for use by technical and non-technical personnel especially in vision field. Advanced understanding of statistical, algebraic and other analytical techniques. Highly organized, motivated and diligent with significant background in Logistics and Commerce.

Overview

7
7
years of professional experience
1
1
Certification

Work History

Data Scientist

Kurly (Commerce & Logistics)
Seoul
05.2023 - 07.2024
  • Transferred the data from AWS Redshift to GCP BigQuery. And verified the data consistency between two type of DataBase.
  • Operated demand forecast model for logistics plan by using ensembles model (XGB, statistics based model etc)
  • Developed the features for improvements of demand forecast model. (When using new feature, model's MAPE decreased about 2%, accuracy 94% -> 96%)
  • Developed dashboard for providing predicted value to logistics team by using Redash.
  • Transferred the visualization tool from Tabluea to Redash. Decreased the cost of operation about 30%
  • Developed new best recommendation tab in company's website. Created 5 topics of best recommendation items and developed recommendation algorithms.
  • Improved the algorithm for detecting blurry images in review pages. By using Laplacian filter of OpenCV module, increased the precisioin of detecting algorithm from 20% to 90%.

Data Scientist

Barogo (Delivery)
10.2021 - 03.2023
  • Support to make business decision : Before progressing new business, analyzed data for probabilities of new business.
  • Mentored junior team members in best practices for data science methodologies, fostering a culture of continuous improvement.
  • Arrange in-house data: Developed classification model of store by using Tensorflow and arrange company's data about store. Increased accuracy of classification to 97% (category level 1 based, totally 19 categories)
  • Developed KPI metrics for business evaluation based on data
  • Advanced DataBase scheme by applying star scheme for usability improvement.
  • Developed recommendation model of time allocation for delivery man by using Tensorflow. Delivery delay decreased about 20%
  • Improved the performanc of python code. Decreased running time from 20 minutes to 10 seconds.

Data Scientist

KBDataSystem (Financial IT)
03.2020 - 09.2021
  • Developed data model, made a plan for data project
  • Develop identification card OCR module : Developed module detecting part of text in the image using OpenCV when the user upload their identification card image to service website. (Accuracy for detecting text : 94%)
  • Develop hyper-personalization model : By using association analysis, detected pattern of consumption in card usages. Also by using clustering algorithm, label the consumption data if there are abnormal card usages. For detecting abnormal data, calculate distance from the center of each clusters.
  • Develop model for optimizing cloud resource : Using 4 features ;Usage of CPU, Memory, Storage, traffic, labeled anomaly data when abnormal patterns are found in 4 features with unsupervised algorithm such as HDBSCAN. Train supervised-model with labeled data using ensemble algorithm such as random forest.

Data Consultant

VTW (IT Consultation)
07.2017 - 12.2019
  • Executing data project as a big data consultant and data analyst
  • Cleaned data using query and construct dataset for analysis. calculate basic statistics, calculate recidivism rate and develop recidivism model.
  • Crawled text by using python and extracted word related job from text with morpheme analysis by using SAS. After cleaning data using SAS and SQL, define relationship between words and develop ontology model, implement matching model of job with ML-algorithms.
  • Analyzed top 100 foreign/domestic stock price and predict stock price using time-stamp algorithm
  • Constructed data set using spark. And extracted words from text. Then merge each data to construct final data set for analysis. Develop suspicion model by using clustering algorithm. Visualize result of analysis with Tableau.

Education

Bachelor of Science - Statistics

Sookmyung Women's University
Seoul, South Korea
02.2018

Skills

  • Statistical Analysis
  • Business Forecasting
  • Data Mining
  • Python Programming
  • Machine Learning
  • Data Operations
  • Feature Engineering
  • Anomaly Detection
  • Natural Language Processing
  • Time Series Analysis
  • Deep Learning
  • Data Visualization

Certification

  • TensorFlow Developer Certification

TensorFlow Certificate Program / Credential ID 43949989

  • Google Cloud Certified Professional Data Engineer

Google / Credential ID 0J3Td6

  • Big Data Analyzer

Korea Data Development Agency / Credential ID BAE-007001469

Languages

Korean
Native or Bilingual
English
Professional Working

Work Preference

Work Type

Full TimePart Time

Work Location

On-SiteRemoteHybrid

Important To Me

Career advancementCompany CultureHealthcare benefitsFlexible work hours

Work Availability

monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse

Software

Python

R

GCP

AWS

Tensorflow

Pytorch

Interests

Computer Vision

MLOps

Unstructured data

Timeline

Data Scientist

Kurly (Commerce & Logistics)
05.2023 - 07.2024

Data Scientist

Barogo (Delivery)
10.2021 - 03.2023

Data Scientist

KBDataSystem (Financial IT)
03.2020 - 09.2021

Data Consultant

VTW (IT Consultation)
07.2017 - 12.2019

Bachelor of Science - Statistics

Sookmyung Women's University
  • TensorFlow Developer Certification

TensorFlow Certificate Program / Credential ID 43949989

  • Google Cloud Certified Professional Data Engineer

Google / Credential ID 0J3Td6

  • Big Data Analyzer

Korea Data Development Agency / Credential ID BAE-007001469

Eunjeong (Ariel) Ahn