Search Results for author: Chacha Chen

Found 18 papers, 6 papers with code

Learning to Rank Visual Stories From Human Ranking Data

1 code implementation ACL 2022 Chi-Yang Hsu, Yun-Wei Chu, Vincent Chen, Kuan-Chieh Lo, Chacha Chen, Ting-Hao Huang, Lun-Wei Ku

In this paper, we present the VHED (VIST Human Evaluation Data) dataset, which first re-purposes human evaluation results for automatic evaluation; hence we develop Vrank (VIST Ranker), a novel reference-free VIST metric for story evaluation.

Learning-To-Rank Visual Storytelling

Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey

no code implementations20 Mar 2025 Xiaoou Liu, Tiejin Chen, Longchao Da, Chacha Chen, Zhen Lin, Hua Wei

To address this, we introduce a new taxonomy that categorizes UQ methods based on computational efficiency and uncertainty dimensions (input, reasoning, parameter, and prediction uncertainty).

Computational Efficiency Decision Making +2

MCQA-Eval: Efficient Confidence Evaluation in NLG with Gold-Standard Correctness Labels

no code implementations20 Feb 2025 Xiaoou Liu, Zhen Lin, Longchao Da, Chacha Chen, Shubhendu Trivedi, Hua Wei

Large Language Models (LLMs) require robust confidence estimation, particularly in critical domains like healthcare and law where unreliable outputs can lead to significant consequences.

Multiple-choice Text Generation

Can Domain Experts Rely on AI Appropriately? A Case Study on AI-Assisted Prostate Cancer MRI Diagnosis

no code implementations3 Feb 2025 Chacha Chen, Han Liu, Jiamin Yang, Benjamin M. Mervak, Bora Kalaycioglu, Grace Lee, Emre Cakmakli, Matteo Bonatti, Sridhar Pudu, Osman Kahraman, Gul Gizem Pamuk, Aytekin Oto, Aritrick Chatterjee, Chenhao Tan

Building on existing tools for teaching prostate cancer diagnosis, we develop an interface and conduct two experiments to study how AI assistance and performance feedback shape the decision making of domain experts.

Decision Making

GPT-4V Cannot Generate Radiology Reports Yet

2 code implementations16 Jul 2024 Yuyang Jiang, Chacha Chen, Dang Nguyen, Benjamin M. Mervak, Chenhao Tan

To understand the low performance, we decompose the task into two steps: 1) the medical image reasoning step of predicting medical condition labels from images; and 2) the report synthesis step of generating reports from (groundtruth) conditions.

OpenHEXAI: An Open-Source Framework for Human-Centered Evaluation of Explainable Machine Learning

no code implementations20 Feb 2024 Jiaqi Ma, Vivian Lai, Yiming Zhang, Chacha Chen, Paul Hamilton, Davor Ljubenkov, Himabindu Lakkaraju, Chenhao Tan

However, properly evaluating the effectiveness of the XAI methods inevitably requires the involvement of human subjects, and conducting human-centered benchmarks is challenging in a number of ways: designing and implementing user studies is complex; numerous design choices in the design space of user study lead to problems of reproducibility; and running user studies can be challenging and even daunting for machine learning researchers.

Decision Making Fairness

Pragmatic Radiology Report Generation

1 code implementation28 Nov 2023 Dang Nguyen, Chacha Chen, He He, Chenhao Tan

When pneumonia is not found on a chest X-ray, should the report describe this negative observation or omit it?

Image to text

Learning Human-Compatible Representations for Case-Based Decision Support

1 code implementation6 Mar 2023 Han Liu, Yizhou Tian, Chacha Chen, Shi Feng, Yuxin Chen, Chenhao Tan

Despite the promising performance of supervised learning, representations learned by supervised models may not align well with human intuitions: what models consider as similar examples can be perceived as distinct by humans.

Classification Decision Making +2

Contextual Dynamic Prompting for Response Generation in Task-oriented Dialog Systems

no code implementations30 Jan 2023 Sandesh Swamy, Narges Tabari, Chacha Chen, Rashmi Gangadharaiah

Specifically, we propose an approach that performs contextual dynamic prompting where the prompts are learnt from dialog contexts.

Response Generation

Machine Explanations and Human Understanding

1 code implementation8 Feb 2022 Chacha Chen, Shi Feng, Amit Sharma, Chenhao Tan

Our key result is that without assumptions about task-specific intuitions, explanations may potentially improve human understanding of model decision boundary, but they cannot improve human understanding of task decision boundary or model error.

Decision Making Open-Ended Question Answering

Towards a Science of Human-AI Decision Making: A Survey of Empirical Studies

no code implementations21 Dec 2021 Vivian Lai, Chacha Chen, Q. Vera Liao, Alison Smith-Renner, Chenhao Tan

Besides developing AI technologies for this purpose, the emerging field of human-AI decision making must embrace empirical approaches to form a foundational understanding of how humans interact and work with AI to make decisions.

Decision Making Survey

Learning to Simulate on Sparse Trajectory Data

no code implementations22 Mar 2021 Hua Wei, Chacha Chen, Chang Liu, Guanjie Zheng, Zhenhui Li

Simulation of the real-world traffic can be used to help validate the transportation policies.

Imitation Learning

UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced Data

no code implementations22 Oct 2020 Chacha Chen, Junjie Liang, Fenglong Ma, Lucas M. Glass, Jimeng Sun, Cao Xiao

However, existing uncertainty estimation approaches often failed in handling high-dimensional data, which are present in multi-sourced data.

Clustering Prediction +1

CoLight: Learning Network-level Cooperation for Traffic Signal Control

4 code implementations11 May 2019 Hua Wei, Nan Xu, Huichu Zhang, Guanjie Zheng, Xinshi Zang, Chacha Chen, Wei-Nan Zhang, Yanmin Zhu, Kai Xu, Zhenhui Li

To enable cooperation of traffic signals, in this paper, we propose a model, CoLight, which uses graph attentional networks to facilitate communication.

Multi-agent Reinforcement Learning Reinforcement Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.