Search Results for author: John P. Lalor

Found 15 papers, 4 papers with code

Learning Latent Parameters without Human Response Patterns: Item Response Theory with Artificial Crowds

1 code implementation • IJCNLP 2019 • John P. Lalor, Hao Wu, Hong Yu

We demonstrate a use-case for latent difficulty item parameters, namely training set filtering, and show that using difficulty to sample training data outperforms baseline methods.

Natural Language Inference Sentiment Analysis

108

Paper
Code

Evaluation Examples are not Equally Informative: How should that change NLP Leaderboards?

1 code implementation • ACL 2021 • Pedro Rodriguez, Joe Barrow, Alexander Miserlis Hoyle, John P. Lalor, Robin Jia, Jordan Boyd-Graber

While leaderboards are a straightforward ranking of NLP models, this simplicity can mask nuances in evaluation items (examples) and subjects (NLP models).

108

Paper
Code

py-irt: A Scalable Item Response Theory Library for Python

1 code implementation • 2 Mar 2022 • John P. Lalor, Pedro Rodriguez

py-irt is a Python library for fitting Bayesian Item Response Theory (IRT) models.

108

Paper
Code

Constructing a Psychometric Testbed for Fair Natural Language Processing

1 code implementation • EMNLP 2021 • Ahmed Abbasi, David Dobolyi, John P. Lalor, Richard G. Netemeyer, Kendall Smith, Yi Yang

We also discuss the important implications of our work and resulting testbed for future NLP research on psychometrics and fairness.

Benchmarking Fairness +2

Paper
Code

Understanding Deep Learning Performance through an Examination of Test Set Difficulty: A Psychometric Case Study

no code implementations • EMNLP 2018 • John P. Lalor, Hao Wu, Tsendsuren Munkhdalai, Hong Yu

We examine the impact of a test set question's difficulty to determine if there is a relationship between difficulty and performance.

Natural Language Inference Sentiment Analysis

Paper
Add Code

Soft Label Memorization-Generalization for Natural Language Inference

no code implementations • 27 Feb 2017 • John P. Lalor, Hao Wu, Hong Yu

Often when multiple labels are obtained for a training example it is assumed that there is an element of noise that must be accounted for.

Memorization Natural Language Inference

Paper
Add Code

Building an Evaluation Scale using Item Response Theory

no code implementations • EMNLP 2016 • John P. Lalor, Hao Wu, Hong Yu

Evaluation of NLP methods requires testing against a previously vetted gold-standard test set and reporting standard metrics (accuracy/precision/recall/F1).

Natural Language Inference

Paper
Add Code

Citation Analysis with Neural Attention Models

no code implementations • WS 2016 • Tsendsuren Munkhdalai, John P. Lalor, Hong Yu

Information Retrieval Question Answering +1

Paper
Add Code

Efficient Semi-Supervised Learning for Natural Language Understanding by Optimizing Diversity

no code implementations • 9 Oct 2019 • Eunah Cho, He Xie, John P. Lalor, Varun Kumar, William M. Campbell

In addition, methods optimizing diversity can reduce training data in many cases to 50% with little impact on performance.

Natural Language Understanding Task-Oriented Dialogue Systems

Paper
Add Code

Dynamic Data Selection for Curriculum Learning via Ability Estimation

no code implementations • Findings of the Association for Computational Linguistics 2020 • John P. Lalor, Hong Yu

Curriculum learning methods typically rely on heuristics to estimate the difficulty of training examples or the ability of the model.

Paper
Add Code

An Empirical Analysis of Human-Bot Interaction on Reddit

no code implementations • EMNLP (WNUT) 2020 • Ming-Cheng Ma, John P. Lalor

Automated agents (“bots”) have emerged as an ubiquitous and influential presence on social media.

Paper
Add Code

Measuring algorithmic interpretability: A human-learning-based framework and the corresponding cognitive complexity score

no code implementations • 20 May 2022 • John P. Lalor, Hong Guo

We illustrate the measurement framework through a toy example, describe the framework and its conceptual underpinnings, and demonstrate the benefits of the framework, in particular for managers considering tradeoffs when selecting algorithms.

Fairness

Paper
Add Code

Stars Are All You Need: A Distantly Supervised Pyramid Network for Unified Sentiment Analysis

no code implementations • 2 May 2023 • Wenchang Li, Yixing Chen, Shuang Zheng, Lei Wang, John P. Lalor

We also demonstrate the interpretability of DSPN's outputs on reviews to show the pyramid structure inherent in unified sentiment analysis.

Aspect Category Detection Aspect Category Sentiment Analysis +1

Paper
Add Code

Bias A-head? Analyzing Bias in Transformer-Based Language Model Attention Heads

no code implementations • 17 Nov 2023 • Yi Yang, Hanyu Duan, Ahmed Abbasi, John P. Lalor, Kar Yan Tam

Although a burgeoning literature has emerged on stereotypical bias mitigation in PLMs, such as work on debiasing gender and racial stereotyping, how such biases manifest and behave internally within PLMs remains largely unknown.

Fairness Language Modelling

Paper
Add Code

H-COAL: Human Correction of AI-Generated Labels for Biomedical Named Entity Recognition

no code implementations • 20 Nov 2023 • Xiaojing Duan, John P. Lalor

With the rapid advancement of machine learning models for NLP tasks, collecting high-fidelity labels from AI models is a realistic possibility.

named-entity-recognition Named Entity Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.