Summary
Overview
Work History
Education
Skills
Tools/Libraries
Activity
Accomplishments
Publication List
Timeline
Generic

Yeojeong Kim

Machine Learning Engineer
Sunnyvale,CA

Summary

Accomplished Machine Learining Research Scientist with a proven track record at Got It AI and ETRI, specializing in speech and NLP. Expert in Python and Java, demonstrating exceptional problem-solving abilities and a commitment to innovation. Skilled in deploying large language models and providing technical support, showcasing both technical proficiency and collaborative skills.

Overview

9
9
years of professional experience
8
8
years of post-secondary education
2
2
Languages

Work History

Research Scientist

Got It AI: Conversational AI Startup
01.2022 - 09.2023
  • Large Language Model (LLM) training, deployment using Google Vertex AI, AWS Sagemaker, Azure, and HuggingFace
  • Prompt engineering: Creating effective and contextually relevant prompts for natural language processing models
  • MLOps

- Optimization: Utilizing the NVIDIA Triton Inference Server for efficient model inference

- Pipelining: Implementing automated data pipelines using AutoFlow and Vertex AI

  • Voice input normalization processing for NLP: Developing and implementing advanced algorithms to process voice inputs for natural language processing tasks
  • Form detection in transcription data: Utilizing machine learning techniques to detect and extract structured data from transcription records
  • Data collection Supervision: Designed the structure of the data to be collected and supervised/monitored the quality of the collected data

Researcher

ETRI: Korean National AI Research Institute
02.2017 - 01.2022
  • Multi-language Speech Recognizer development: Acoustic model, Language model, g2p handling, E2E model
  • Offering public service for PyeongChang 2018 Winter Olympic official automatic translator (Genie Talk) - 13 languages
  • Providing technical support to various companies that have received technology transfer
  • Constructing Korean Spontaneous Speech Database for machine learning

IT Consultant Intern

Hewlett-Packard, Korea
07.2014 - 08.2014
  • Virtual consulting with NFC and SDN for SK Telecom

Education

Master of Engineering - Electrical Engineering And Computer Science

Gwangju Institute Science And Technology (GIST)
South Korea
03.2015 - 02.2017

Bachelor of Science - Digital Media

Ajou University
South Korea
03.2009 - 02.2015

Skills

Python

Java

Shell Script

C

Tools/Libraries

  • PyTorch
  • Huggingface Transformers
  • Docker
  • Pandas, Keras, Scikit-Learn
  • Google Vertex AI, Amazon Sagemaker
  • Locust
  • Tensorflow
  • Espnet, Kaldi

Activity

- Volunteer at Stanford University Community Committee for International Students(CCIS)

Accomplishments

  • The grand prize, Final Project of Android and Java Framework Expert Program, Korea Software Technology Association, 2013.

Publication List

Journal Papers

1. Bang, J. U., Yun, S., Kim, S. H., Choi, M. Y., Lee, M. K., Kim, Y. J., Kim, D. H., Park, J., Lee, Y. J., and Kim, S. H. (2020). KsponSpeech: Korean Spontaneous Speech Corpus for Automatic Speech Recognition. Applied Sciences, 10(19), 6936.

Conference Papers

1. Kim, Y. J., Kim, J. S., Yun, S., and Kim, S. H. (2017). Arabic Speech Recognition for Automatic Translation. Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA) 2017, pp.262-265.

Timeline

Research Scientist

Got It AI: Conversational AI Startup
01.2022 - 09.2023

Researcher

ETRI: Korean National AI Research Institute
02.2017 - 01.2022

Master of Engineering - Electrical Engineering And Computer Science

Gwangju Institute Science And Technology (GIST)
03.2015 - 02.2017

IT Consultant Intern

Hewlett-Packard, Korea
07.2014 - 08.2014

Bachelor of Science - Digital Media

Ajou University
03.2009 - 02.2015
Yeojeong KimMachine Learning Engineer