Summary
Overview
Work History
Education
Skills
Languages
Accomplishments
Test Scores
Timeline
Generic

Soohyun Kim

New York,NY

Summary

Data scientist with a PhD in Biostatistics and experience driving data-informed decision-making for digital health interventions serving large and diverse user populations. Although trained in academia, extensive experience writing and contributing to multi-million-dollar NIH grant proposals has involved developing study strategy, defining success metrics, forecasting impact, and communicating analytical plans to secure competitive funding—an experience analogous to data-driven business case development. Expertise in product analytics, experimentation, causal inference, and machine learning for behavioral and engagement optimization. Proven ability to translate complex analyses into clear recommendations for cross-functional stakeholders and to lead end-to-end analytics for randomized experiments and large-scale longitudinal data.

Overview

5
5
years of professional experience

Work History

Data Science Trainee

Center for Research and Education on Aging and Tec
New York, New York
11.2025 - Current
  • Support analytics for NIH-funded digital platform focused on social connection and mental health for community-living older adults.
  • Identify behavioral drivers of engagement and retention across product features.
  • Collaborate with UX and intervention teams to translate data insights into product improvements.

Postdoctoral Associate

Weill Cornell Medicine – Department of Population
New York, NY
11.2023 - Current
  • Lead analytics strategy for digital health interventions targeting older adults, informing product design and feature prioritization for engagement and retention.
  • Partner with product, engineering, clinicians, and behavioral scientists to define product goals, success metrics, and experimentation roadmaps.
  • Design and analyze randomized controlled trials (N≈250–300) to evaluate product impact on user outcomes and engagement.
  • Develop scalable models to infer user engagement states from high-frequency behavioral and passive data, enabling personalized product interventions.
  • Conduct large-scale longitudinal analyses using Python/R and SQL-style data workflows on intensive time-series and behavioral datasets.
  • Translate complex analyses into leadership-level presentations that inform grant funding decisions, study design, and product development priorities.
  • Lead development of statistical analysis plans, forecasting scenarios, and power analyses for multi-year federally funded initiatives.
  • Mentor graduate students and junior analysts; coordinate analytics workstreams across interdisciplinary teams.

Research Assistant

Columbia University
New York, New York
09.2020 - 10.2023
  • Led development of machine learning models for large-scale ecological momentary assessment (EMA) data (high-frequency user-level behavioral data).
  • Built predictive models to identify user subgroups and personalize interventions.
  • Developed a causal estimation framework that integrates data across multiple studies, improving statistical power and enabling more robust cross-study inference.
  • Communicated findings to clinical, technical, and leadership stakeholders to guide intervention design.
  • Contributed to multiple collaborative projects across research, clinical, and technical teams.

Education

Ph.D. - Biostatistics

Columbia University
New York
08-2023

Bachelor of Arts - Mathematics, Economics

Mount Holyoke College
South Hadley
05-2016

Skills

  • machine learning
  • data analysis
  • statistical modeling
  • randomized controlled trials
  • user engagement strategies
  • product design
  • mentorship and training
  • data visualization
  • Deep learning
  • longitudinal studies
  • Recurrent neural networks
  • Time series analysis
  • Neural networks
  • Random forests
  • Support vector machines
  • Simulation modeling

Languages

English
Native/ Bilingual
Korean
Native/ Bilingual
French
Professional

Accomplishments

Biostatistics Ph.D. Fellowship, Columbia University

Test Scores

  • SAT: 2350 (Reading: 800, Math: 790, Writing: 760)
  • GRE: 335 (Verbal Reasoning: 165, Quantitative Reasoning: 170)

Timeline

Data Science Trainee

Center for Research and Education on Aging and Tec
11.2025 - Current

Postdoctoral Associate

Weill Cornell Medicine – Department of Population
11.2023 - Current

Research Assistant

Columbia University
09.2020 - 10.2023

Ph.D. - Biostatistics

Columbia University

Bachelor of Arts - Mathematics, Economics

Mount Holyoke College
Soohyun Kim