Search Results for author: Yu Ying Chiu

Found 7 papers, 3 papers with code

DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life

no code implementations3 Oct 2024 Yu Ying Chiu, Liwei Jiang, Yejin Choi

As we increasingly seek guidance from LLMs for decision-making in daily life, many of these decisions are not clear-cut and depend significantly on the personal values and ethical standards of the users.

Philosophy Sociology

WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries

no code implementations24 Jul 2024 Wenting Zhao, Tanya Goyal, Yu Ying Chiu, Liwei Jiang, Benjamin Newman, Abhilasha Ravichander, Khyathi Chandu, Ronan Le Bras, Claire Cardie, Yuntian Deng, Yejin Choi

While hallucinations of large language models (LLMs) prevail as a major challenge, existing evaluation benchmarks on factuality do not cover the diverse domains of knowledge that the real-world users of LLMs seek information about.

Chatbot Form +2

Filtered Corpus Training (FiCT) Shows that Language Models can Generalize from Indirect Evidence

1 code implementation24 May 2024 Abhinav Patil, Jaap Jumelet, Yu Ying Chiu, Andy Lapastora, Peter Shen, Lexie Wang, Clevis Willrich, Shane Steinert-Threlkeld

This paper introduces Filtered Corpus Training, a method that trains language models (LMs) on corpora with certain linguistic constructions filtered out from the training data, and uses it to measure the ability of LMs to perform linguistic generalization on the basis of indirect evidence.

CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge

no code implementations10 Apr 2024 Yu Ying Chiu, Liwei Jiang, Maria Antoniak, Chan Young Park, Shuyue Stella Li, Mehar Bhatia, Sahithya Ravi, Yulia Tsvetkov, Vered Shwartz, Yejin Choi

Our study reveals that CulturalTeaming's various modes of AI assistance support annotators in creating cultural questions, that modern LLMs fail at, in a gamified manner.

Red Teaming

A Computational Framework for Behavioral Assessment of LLM Therapists

1 code implementation1 Jan 2024 Yu Ying Chiu, ASHISH SHARMA, Inna Wanyin Lin, Tim Althoff

The emergence of large language models (LLMs) like ChatGPT has increased interest in their use as therapists to address mental health challenges and the widespread lack of access to care.

In-Context Learning

Humanoid Agents: Platform for Simulating Human-like Generative Agents

1 code implementation9 Oct 2023 Zhilin Wang, Yu Ying Chiu, Yu Cheung Chiu

Just as computational simulations of atoms, molecules and cells have shaped the way we study the sciences, true-to-life simulations of human-like agents can be valuable tools for studying human behavior.

Unity

Cannot find the paper you are looking for? You can Submit a new open access paper.