Search Results for author: Yeganeh Kordi

Found 4 papers, 3 papers with code

Tur[k]ingBench: A Challenge Benchmark for Web Agents

no code implementations18 Mar 2024 Kevin Xu, Yeganeh Kordi, Tanay Nayak, Ado Asija, Yizhong Wang, Kate Sanders, Adam Byerly, Jingyu Zhang, Benjamin Van Durme, Daniel Khashabi

To support the evaluation of TurkingBench, we have developed a framework that links chatbot responses to actions on web pages (e. g., modifying a text box, selecting a radio button).

Chatbot

Self-Instruct: Aligning Language Models with Self-Generated Instructions

19 code implementations20 Dec 2022 Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A. Smith, Daniel Khashabi, Hannaneh Hajishirzi

Applying our method to the vanilla GPT3, we demonstrate a 33% absolute improvement over the original model on Super-NaturalInstructions, on par with the performance of InstructGPT-001, which was trained with private user data and human annotations.

Instruction Following Language Modelling

UnifiedQA-v2: Stronger Generalization via Broader Cross-Format Training

1 code implementation23 Feb 2022 Daniel Khashabi, Yeganeh Kordi, Hannaneh Hajishirzi

We present UnifiedQA-v2, a QA model built with the same process as UnifiedQA, except that it utilizes more supervision -- roughly 3x the number of datasets used for UnifiedQA.

Question Answering

Cannot find the paper you are looking for? You can Submit a new open access paper.