Designed 50+ complex, dataset-driven evaluation frameworks to benchmark large language models across real-world analytical workflows.
Developed multi-step analytical prompts incorporating SQL-style querying, feature engineering, statistical modeling, and visualization to produce verifiable outputs.
Stress-tested reasoning and generalization across large, structured datasets to improve model robustness and decision-support reliability.
Delivered structured insights and documentation to improve AI-driven analytics performance and stakeholder usability.
Data Scientist / Research Assistant
University of Washington – Genomics
Greater Seattle Area
08.2023 - 02.2025
Analyzed large-scale genomic and spatial datasets using Python, SQL, and distributed computing tools.
Built and optimized predictive models (Random Forest, XGBoost) to identify statistically significant patterns across high-dimensional data.
Partnered with cross-functional research teams to translate domain questions into analytical plans and measurable outputs.
Designed scalable data pipelines for preprocessing, feature extraction, and model evaluation using distributed systems and cloud storage solutions.
Presented findings to technical and non-technical stakeholders to guide research and funding priorities.
Senior Data Analyst
American Institutes for Research (AIR)
Greater Seattle Area
08.2020 - 07.2023
Developed production-grade participant reporting pipelines supporting the CMS ESRD ETC model within strict contractual deadlines.
Designed and implemented complex SAS and SQL-based geospatial algorithms to evaluate healthcare coordination and program performance.
Automated QA validation workflows (SAS, VBA) to improve reporting accuracy by +12%.
Extracted and analyzed large relational datasets to evaluate program impact and inform executive-level decisions.
Validated public-facing web data products to ensure accuracy, usability, and compliance (HIPAA, PII).
Senior Reporting Analyst
Optum (UnitedHealth Group)
Southern California
04.2016 - 08.2020
Developed enterprise reporting across claims, diagnosis, pharmacy, lab, and screening datasets using SAS, SQL, and Python.
Maintained recurring dashboards and trend analyses supporting the Medicare Risk Adjustment strategy for over 15 healthcare insurance plans.
Investigated complex warehouse data structures to explain reporting variances and drive data-informed decisions.
Partnered with business stakeholders to define reporting requirements and translate business needs into analytical deliverables.
Ensured compliance across healthcare data pipelines and reporting systems.
Research Specialist Intern
County of Riverside – Research Analysis & Decision Support
Southern California
03.2014 - 04.2016
Extracted and integrated big-data datasets with millions of records using SQL pass-through queries and automated workflows.
Conducted quantitative and qualitative analyses across 3 public assistance programs to inform policy decisions.
Designed customer satisfaction surveys and applied advanced statistical modeling and text analytics to derive actionable insights.
Trained research teams in advanced SAS automation techniques and workflow optimization.
Education
Master of Science - Data Science
University of Washington, Seattle, WA
03-2025
Bachelor of Arts - Psychology
California State University, Fullerton, Fullerton, CA
01-2015
Skills
SAS programming
SQL database management
Python programming
Distributed Systems
Generative AI (GenAI) product design
Statistical modeling and analysis
AI-Agent pipelines
Cross-functional teamwork
Analytical problem-solving
Prioritization and scheduling
Certification
Microsoft Certified: Power BI Data Analyst Associate
Timeline
Contract AI Data Analyst - Outlier AI
10.2025 - Current
Data Scientist / Research Assistant - University of Washington – Genomics
08.2023 - 02.2025
Senior Data Analyst - American Institutes for Research (AIR)