Search Results for author: Assaf Toledo

Found 13 papers, 4 papers with code

Statistical multi-metric evaluation and visualization of LLM system predictive performance

no code implementations30 Jan 2025 Samuel Ackerman, Eitan Farchi, Orna Raz, Assaf Toledo

The evaluation of generative or discriminative large language model (LLM)-based systems is often a complex multi-dimensional problem.

Code Generation Decision Making +1

Genie: Achieving Human Parity in Content-Grounded Datasets Generation

no code implementations25 Jan 2024 Asaf Yehudai, Boaz Carmeli, Yosi Mass, Ofir Arviv, Nathaniel Mills, Assaf Toledo, Eyal Shnarch, Leshem Choshen

Furthermore, we compare models trained on our data with models trained on human-written data -- ELI5 and ASQA for LFQA and CNN-DailyMail for Summarization.

Long Form Question Answering

VIRATrustData: A Trust-Annotated Corpus of Human-Chatbot Conversations About COVID-19 Vaccines

no code implementations24 May 2022 Roni Friedman, João Sedoc, Shai Gretz, Assaf Toledo, Rose Weeks, Naor Bar-Zeev, Yoav Katz, Noam Slonim

Public trust in medical information is crucial for successful application of public health policies such as vaccine uptake.

Chatbot

A Large-scale Dataset for Argument Quality Ranking: Construction and Analysis

2 code implementations26 Nov 2019 Shai Gretz, Roni Friedman, Edo Cohen-Karlik, Assaf Toledo, Dan Lahav, Ranit Aharonov, Noam Slonim

To this end, we created a corpus of 30, 497 arguments carefully annotated for point-wise quality, released as part of this work.

Syntactic Interchangeability in Word Embedding Models

1 code implementation WS 2019 Daniel Hershcovich, Assaf Toledo, Alon Halfon, Noam Slonim

Nearest neighbors in word embedding models are commonly observed to be semantically similar, but the relations between them can vary greatly.

POS valid +1

Cannot find the paper you are looking for? You can Submit a new open access paper.