Search Results for author: Alexander Wan

Found 3 papers, 3 papers with code

What Evidence Do Language Models Find Convincing?

1 code implementation19 Feb 2024 Alexander Wan, Eric Wallace, Dan Klein

Retrieval-augmented language models are being increasingly tasked with subjective, contentious, and conflicting queries such as "is aspartame linked to cancer".

counterfactual Misinformation +1

Poisoning Language Models During Instruction Tuning

1 code implementation1 May 2023 Alexander Wan, Eric Wallace, Sheng Shen, Dan Klein

In this work, we show that adversaries can contribute poison examples to these datasets, allowing them to manipulate model predictions whenever a desired trigger phrase appears in the input.

GLUECons: A Generic Benchmark for Learning Under Constraints

1 code implementation16 Feb 2023 Hossein Rajaby Faghihi, Aliakbar Nafar, Chen Zheng, Roshanak Mirzaee, Yue Zhang, Andrzej Uszok, Alexander Wan, Tanawan Premsri, Dan Roth, Parisa Kordjamshidi

Recent research has shown that integrating domain knowledge into deep learning architectures is effective -- it helps reduce the amount of required data, improves the accuracy of the models' decisions, and improves the interpretability of models.

Cannot find the paper you are looking for? You can Submit a new open access paper.