Search Results for author: Kellin Pelrine

Found 13 papers, 7 papers with code

Combining Confidence Elicitation and Sample-based Methods for Uncertainty Quantification in Misinformation Mitigation

no code implementations13 Jan 2024 Mauricio Rivera, Jean-François Godbout, Reihaneh Rabbany, Kellin Pelrine

We propose an uncertainty quantification framework that leverages both direct confidence elicitation and sampled-based consistency methods to provide better calibration for NLP misinformation mitigation solutions.

Misinformation Uncertainty Quantification

Uncertainty Resolution in Misinformation Detection

no code implementations2 Jan 2024 Yury Orlovskiy, Camille Thibault, Anne Imouza, Jean-François Godbout, Reihaneh Rabbany, Kellin Pelrine

Misinformation poses a variety of risks, such as undermining public trust and distorting factual discourse.

Misinformation

Exploiting Novel GPT-4 APIs

1 code implementation21 Dec 2023 Kellin Pelrine, Mohammad Taufeeque, Michał Zając, Euan McLean, Adam Gleave

Language model attacks typically assume one of two extreme threat models: full white-box access to model weights, or black-box access limited to a text generation API.

Language Modelling Retrieval +1

Open, Closed, or Small Language Models for Text Classification?

no code implementations19 Aug 2023 Hao Yu, Zachary Yang, Kellin Pelrine, Jean Francois Godbout, Reihaneh Rabbany

Recent advancements in large language models have demonstrated remarkable capabilities across various NLP tasks.

Misinformation Model Selection +4

Towards Reliable Misinformation Mitigation: Generalization, Uncertainty, and GPT-4

1 code implementation24 May 2023 Kellin Pelrine, Anne Imouza, Camille Thibault, Meilina Reksoprodjo, Caleb Gupta, Joel Christoph, Jean-François Godbout, Reihaneh Rabbany

We propose focusing on generalization, uncertainty, and how to leverage recent large language models, in order to create more practical tools to evaluate information veracity in contexts where perfect classification is impossible.

Classification Misinformation +1

Adversarial Policies Beat Superhuman Go AIs

2 code implementations1 Nov 2022 Tony T. Wang, Adam Gleave, Tom Tseng, Kellin Pelrine, Nora Belrose, Joseph Miller, Michael D. Dennis, Yawen Duan, Viktor Pogrebniak, Sergey Levine, Stuart Russell

The core vulnerability uncovered by our attack persists even in KataGo agents adversarially trained to defend against our attack.

Towards Better Evaluation for Dynamic Link Prediction

1 code implementation20 Jul 2022 Farimah Poursafaei, Shenyang Huang, Kellin Pelrine, Reihaneh Rabbany

To evaluate against more difficult negative edges, we introduce two more challenging negative sampling strategies that improve robustness and better match real-world applications.

Dynamic Link Prediction Memorization

The Surprising Performance of Simple Baselines for Misinformation Detection

2 code implementations14 Apr 2021 Kellin Pelrine, Jacob Danovitch, Reihaneh Rabbany

As social media becomes increasingly prominent in our day to day lives, it is increasingly important to detect informative content and prevent the spread of disinformation and unverified rumours.

Fake News Detection Misinformation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.