Search Results for author: Matthew Lease

Found 38 papers, 15 papers with code

Benchmark Transparency: Measuring the Impact of Data on Evaluation

no code implementations31 Mar 2024 Venelin Kovatchev, Matthew Lease

In this paper we present an exploratory research on quantifying the impact that data distribution has on the performance and evaluation of NLP models.

Diverse, but Divisive: LLMs Can Exaggerate Gender Differences in Opinion Related to Harms of Misinformation

no code implementations29 Jan 2024 Terrence Neumann, Sooyong Lee, Maria De-Arteaga, Sina Fazelpour, Matthew Lease

We pose two central questions: (1) To what extent do prompts with explicit gender references reflect gender differences in opinion in the United States on topics of social relevance?

Fact Checking Language Modelling +2

A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation Tasks

1 code implementation20 Dec 2023 Alexander Braylan, Madalyn Marabella, Omar Alonso, Matthew Lease

Beyond investigating these research questions above, we discuss the foundational concept of annotation complexity, present a new aggregation model as a bridge between traditional models and our own, and contribute a new semi-supervised learning method for complex label aggregation that outperforms prior work.

Model Selection Navigate

Interpretable by Design: Wrapper Boxes Combine Neural Performance with Faithful Explanations

no code implementations15 Nov 2023 Yiheng Su, Juni Jessy Li, Matthew Lease

Can we preserve the accuracy of neural models while also providing faithful explanations?

Human-centered NLP Fact-checking: Co-Designing with Fact-checkers using Matchmaking for AI

no code implementations14 Aug 2023 Houjiang Liu, Anubrata Das, Alexander Boltz, Didi Zhou, Daisy Pinaroc, Matthew Lease, Min Kyung Lee

While many Natural Language Processing (NLP) techniques have been proposed for fact-checking, both academic research and fact-checking organizations report limited adoption of such NLP work due to poor alignment with fact-checker practices, values, and needs.

Fact Checking Misinformation

Designing Closed-Loop Models for Task Allocation

1 code implementation31 May 2023 Vijay Keswani, L. Elisa Celis, Krishnaram Kenthapadi, Matthew Lease

Instead, we find ourselves in a "closed" decision-making loop in which the same fallible human decisions we rely on in practice must also be used to guide task allocation.

Decision Making

Same Same, But Different: Conditional Multi-Task Learning for Demographic-Specific Toxicity Detection

1 code implementation14 Feb 2023 Soumyajit Gupta, Sooyong Lee, Maria De-Arteaga, Matthew Lease

We propose framing toxicity detection as multi-task learning (MTL), allowing a model to specialize on the relationships that are relevant to each demographic group while also leveraging shared properties across groups.

Multi-Task Learning

Learning Complementary Policies for Human-AI Teams

no code implementations6 Feb 2023 Ruijiang Gao, Maytal Saar-Tsechansky, Maria De-Arteaga, Ligong Han, Wei Sun, Min Kyung Lee, Matthew Lease

We then extend our approach to leverage opportunities and mitigate risks that arise in important contexts in practice: 1) when a team is composed of multiple humans with differential and potentially complementary abilities, 2) when the observational data includes consistent deterministic actions, and 3) when the covariate distribution of future decisions differ from that in the historical data.

New Metrics to Encourage Innovation and Diversity in Information Retrieval Approaches

1 code implementation19 Jan 2023 Mehmet Deniz Türkmen, Matthew Lease, Mucahid Kutlu

In addition, we show that our metrics achieve higher evaluation stability and discriminative power than the standard metrics we modify.

Information Retrieval Retrieval

The State of Human-centered NLP Technology for Fact-checking

no code implementations8 Jan 2023 Anubrata Das, Houjiang Liu, Venelin Kovatchev, Matthew Lease

We recommend that future research include collaboration with fact-checker stakeholders early on in NLP research, as well as incorporation of human-centered design practices in model development, in order to further guide technology development for human use and practical adoption.

Explainable Models Fact Checking +1

Measuring Annotator Agreement Generally across Complex Structured, Multi-object, and Free-text Annotation Tasks

1 code implementation15 Dec 2022 Alexander Braylan, Omar Alonso, Matthew Lease

When annotators label data, a key metric for quality assurance is inter-annotator agreement (IAA): the extent to which annotators agree on their labels.

text annotation

Fairly Accurate: Learning Optimal Accuracy vs. Fairness Tradeoffs for Hate Speech Detection

no code implementations15 Apr 2022 Venelin Kovatchev, Soumyajit Gupta, Anubrata Das, Matthew Lease

In this work, we first introduce a differentiable measure that enables direct optimization of group fairness (specifically, balancing accuracy across groups) in model training.

Fairness Hate Speech Detection

The Effects of Interactive AI Design on User Behavior: An Eye-tracking Study of Fact-checking COVID-19 Claims

1 code implementation17 Feb 2022 Li Shi, Nilavra Bhattacharya, Anubrata Das, Matthew Lease, Jacek Gwidzka

We conducted a lab-based eye-tracking study to investigate how the interactivity of an AI-powered fact-checking system affects user interactions, such as dwell time, attention, and mental resources involved in using the system.

Fact Checking

Designing Closed Human-in-the-loop Deferral Pipelines

1 code implementation9 Feb 2022 Vijay Keswani, Matthew Lease, Krishnaram Kenthapadi

Our key insight is that by exploiting weak prior information, we can match experts to input examples to ensure fairness and accuracy of the resulting deferral framework, even when imperfect and biased experts are used in place of ground truth labels.

Decision Making Fairness

In Search of Ambiguity: A Three-Stage Workflow Design to Clarify Annotation Guidelines for Crowd Workers

no code implementations4 Dec 2021 Vivek Krishna Pradhan, Mike Schaekermann, Matthew Lease

We propose a novel three-stage FIND-RESOLVE-LABEL workflow for crowdsourced annotation to reduce ambiguity in task instructions and thus improve annotation quality.

TAG

Data Excellence for AI: Why Should You Care

no code implementations19 Nov 2021 Lora Aroyo, Matthew Lease, Praveen Paritosh, Mike Schaekermann

The efficacy of machine learning (ML) models depends on both algorithms and data.

A Two-Stage Neural-Filter Pareto Front Extractor and the need for Benchmarking

no code implementations29 Sep 2021 Soumyajit Gupta, Gurpreet Singh, Matthew Lease

The Stage-1 neural network efficiently extracts the \textit{weak} Pareto front, using Fritz-John Conditions (FJC) as the discriminator, with no assumptions of convexity on the objectives or constraints.

Benchmarking Multi-Task Learning

The Case for Claim Difficulty Assessment in Automatic Fact Checking

no code implementations20 Sep 2021 Prakhar Singh, Anubrata Das, Junyi Jessy Li, Matthew Lease

Fact-checking is the process of evaluating the veracity of claims (i. e., purported facts).

Fact Checking

Towards Unbiased and Accurate Deferral to Multiple Experts

1 code implementation25 Feb 2021 Vijay Keswani, Matthew Lease, Krishnaram Kenthapadi

Machine learning models are often implemented in cohort with humans in the pipeline, with the model having an option to defer to a domain expert in cases where it has low confidence in its inference.

BIG-bench Machine Learning Fairness

A Hybrid 2-stage Neural Optimization for Pareto Front Extraction

no code implementations27 Jan 2021 Gurpreet Singh, Soumyajit Gupta, Matthew Lease, Clint Dawson

The first stage (neural network) efficiently extracts a weak Pareto front, using Fritz-John conditions as the discriminator, with no assumptions of convexity on the objectives or constraints.

Fairness

Understanding and Predicting Characteristics of Test Collections in Information Retrieval

no code implementations24 Dec 2020 Md Mustafizur Rahman, Mucahid Kutlu, Matthew Lease

Research community evaluations in information retrieval, such as NIST's Text REtrieval Conference (TREC), build reusable test collections by pooling document rankings submitted by many teams.

Information Retrieval Retrieval +1

You Are What You Tweet: Profiling Users by Past Tweets to Improve Hate Speech Detection

no code implementations16 Dec 2020 Prateek Chaudhry, Matthew Lease

Hate speech detection research has predominantly focused on purely content-based methods, without exploiting any additional context.

Hate Speech Detection

Range-Net: A High Precision Streaming SVD for Big Data Applications

no code implementations27 Oct 2020 Gurpreet Singh, Soumyajit Gupta, Matthew Lease, Clint Dawson

Although these methods are claimed to be applicable to scientific computations due to associated tail-energy error bounds, the approximation errors in the singular vectors and values are high when the aforementioned assumption does not hold.

Vocal Bursts Intensity Prediction

Extracting Optimal Solution Manifolds using Constrained Neural Optimization

no code implementations13 Sep 2020 Gurpreet Singh, Soumyajit Gupta, Matthew Lease

However, such an approach is often restricted to a strict class of functions, deviation from which results in sub-optimal solution to the original problem.

Computational Efficiency Hyperspectral Unmixing

A Conceptual Framework for Evaluating Fairness in Search

1 code implementation22 Jul 2019 Anubrata Das, Matthew Lease

While search efficacy has been evaluated traditionally on the basis of result relevance, fairness of search has attracted recent attention.

Fairness

CobWeb: A Research Prototype for Exploring User Bias in Political Fact-Checking

1 code implementation8 Jul 2019 Anubrata Das, Kunjan Mehta, Matthew Lease

The effect of user bias in fact-checking has not been explored extensively from a user-experience perspective.

Fact Checking

Efficient Test Collection Construction via Active Learning

no code implementations17 Jan 2018 Md Mustafizur Rahman, Mucahid Kutlu, Tamer Elsayed, Matthew Lease

To create a new IR test collection at low cost, it is valuable to carefully select which documents merit human relevance judgments.

Active Learning

Aggregating and Predicting Sequence Labels from Crowd Annotations

1 code implementation ACL 2017 An Thanh Nguyen, Byron Wallace, Junyi Jessy Li, Ani Nenkova, Matthew Lease

Despite sequences being core to NLP, scant work has considered how to handle noisy sequence labels from multiple annotators for the same text.

named-entity-recognition Named Entity Recognition +2

Neural Information Retrieval: A Literature Review

no code implementations18 Nov 2016 Ye Zhang, Md Mustafizur Rahman, Alex Braylan, Brandon Dang, Heng-Lu Chang, Henna Kim, Quinten McNamara, Aaron Angert, Edward Banner, Vivek Khetan, Tyler McDonnell, An Thanh Nguyen, Dan Xu, Byron C. Wallace, Matthew Lease

A recent "third wave" of Neural Network (NN) approaches now delivers state-of-the-art performance in many machine learning tasks, spanning speech recognition, computer vision, and natural language processing.

Information Retrieval Retrieval +2

Active Discriminative Text Representation Learning

1 code implementation14 Jun 2016 Ye Zhang, Matthew Lease, Byron C. Wallace

We also show that, as expected, the method quickly learns discriminative word embeddings.

Active Learning Document Classification +6

TurKPF: TurKontrol as a Particle Filter

1 code implementation20 Apr 2014 Ethan Petuchowski, Matthew Lease

TurKontrol, and algorithm presented in (Dai et al. 2010), uses a POMDP to model and control an iterative workflow for crowdsourced work.

Cannot find the paper you are looking for? You can Submit a new open access paper.