● Developed and evaluated domain-specific prompts to assess the performance of large language models (LLMs) in multiple domains.
● Analyzed LLM outputs for scientific accuracy, clarity, and depth in specialized subfields.
● Contributed to improving AI understanding of complex biological topics through expert
review and feedback.
● Conducted independent research to support prompt development and evaluation tasks.
Programming Languages:
Libraries/Frameworks:
Developer Tools:
Other: