Search Results for author: Christopher M. Homan

Found 10 papers, 5 papers with code

Subjective Crowd Disagreements for Subjective Data: Uncovering Meaningful CrowdOpinion with Population-level Learning

1 code implementation • 7 Jul 2023 • Tharindu Cyril Weerasooriya, Sarah Luger, Saloni Poddar, Ashiqur R. KhudaBukhsh, Christopher M. Homan

Human-annotated data plays a critical role in the fairness of AI systems, including those that deal with life-altering decisions or moderating human-created web/social media content.

Fairness

Paper
Code

Vicarious Offense and Noise Audit of Offensive Speech Classifiers: Unifying Human and Machine Disagreement on What is Offensive

2 code implementations • 29 Jan 2023 • Tharindu Cyril Weerasooriya, Sujan Dutta, Tharindu Ranasinghe, Marcos Zampieri, Christopher M. Homan, Ashiqur R. KhudaBukhsh

For (2), we introduce a first-of-its-kind dataset of vicarious offense.

Language Modelling Large Language Model +1

Paper
Code

Cross-lingual Offensive Language Identification for Low Resource Languages: The Case of Marathi

1 code implementation • RANLP 2021 • Saurabh Gaikwad, Tharindu Ranasinghe, Marcos Zampieri, Christopher M. Homan

The widespread presence of offensive language on social media motivated the development of systems capable of recognizing such content automatically.

Language Identification Transfer Learning

Paper
Code

Improving Label Quality by Jointly Modeling Items and Annotators

no code implementations • NLPerspectives (LREC) 2022 • Tharindu Cyril Weerasooriya, Alexander G. Ororbia, Christopher M. Homan

We propose a fully Bayesian framework for learning ground truth labels from noisy annotators.

Clustering

Paper
Add Code

LCP-RIT at SemEval-2021 Task 1: Exploring Linguistic Features for Lexical Complexity Prediction

no code implementations • SEMEVAL 2021 • Abhinandan Desai, Kai North, Marcos Zampieri, Christopher M. Homan

This paper describes team LCP-RIT's submission to the SemEval-2021 Task 1: Lexical Complexity Prediction (LCP).

Lexical Complexity Prediction POS +1

Paper
Add Code

Domain-specific MT for Low-resource Languages: The case of Bambara-French

no code implementations • 31 Mar 2021 • Allahsera Auguste Tapo, Michael Leventhal, Sarah Luger, Christopher M. Homan, Marcos Zampieri

Translating to and from low-resource languages is a challenge for machine translation (MT) systems due to a lack of parallel data.

BIG-bench Machine Learning Machine Translation +1

Paper
Add Code

Assessing Human Translations from French to Bambara for Machine Learning: a Pilot Study

no code implementations • 31 Mar 2020 • Michael Leventhal, Allahsera Tapo, Sarah Luger, Marcos Zampieri, Christopher M. Homan

We present novel methods for assessing the quality of human-translated aligned texts for learning machine translation models of under-resourced languages.

BIG-bench Machine Learning Machine Translation +1

Paper
Add Code

Neighborhood-based Pooling for Population-level Label Distribution Learning

1 code implementation • 16 Mar 2020 • Tharindu Cyril Weerasooriya, Tong Liu, Christopher M. Homan

Supervised machine learning often requires human-annotated data.

Paper
Code

Twitter Job/Employment Corpus: A Dataset of Job-Related Discourse Built with Humans in the Loop

no code implementations • 30 Jan 2019 • Tong Liu, Christopher M. Homan

We present the Twitter Job/Employment Corpus, a collection of tweets annotated by a humans-in-the-loop supervised learning framework that integrates crowdsourcing contributions and expertise on the local community and employment environment.

Paper
Add Code

Learning from various labeling strategies for suicide-related messages on social media: An experimental study

1 code implementation • 30 Jan 2017 • Tong Liu, Qijin Cheng, Christopher M. Homan, Vincent M. B. Silenzio

Suicide is an important but often misunderstood problem, one that researchers are now seeking to better understand through social media.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.