Search Results for author: Anas Awadalla

Found 6 papers, 4 papers with code

VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use

1 code implementation12 Aug 2023 Yonatan Bitton, Hritik Bansal, Jack Hessel, Rulin Shao, Wanrong Zhu, Anas Awadalla, Josh Gardner, Rohan Taori, Ludwig Schmidt

These descriptions enable 1) collecting human-verified reference outputs for each instance; and 2) automatic evaluation of candidate multimodal generations using a text-only LLM, aligning with human judgment.

Instruction Following

Reliable and Trustworthy Machine Learning for Health Using Dataset Shift Detection

no code implementations NeurIPS 2021 Chunjong Park, Anas Awadalla, Tadayoshi Kohno, Shwetak Patel

We then translate the out-of-distribution score into a human interpretable CONFIDENCE SCORE to investigate its effect on the users' interaction with health ML applications.

BIG-bench Machine Learning Medical Diagnosis +1

Cannot find the paper you are looking for? You can Submit a new open access paper.