1 code implementation • 7 Jul 2023 • Tharindu Cyril Weerasooriya, Sarah Luger, Saloni Poddar, Ashiqur R. KhudaBukhsh, Christopher M. Homan
Human-annotated data plays a critical role in the fairness of AI systems, including those that deal with life-altering decisions or moderating human-created web/social media content.
2 code implementations • 29 Jan 2023 • Tharindu Cyril Weerasooriya, Sujan Dutta, Tharindu Ranasinghe, Marcos Zampieri, Christopher M. Homan, Ashiqur R. KhudaBukhsh
For (2), we introduce a first-of-its-kind dataset of vicarious offense.
1 code implementation • RANLP 2021 • Saurabh Gaikwad, Tharindu Ranasinghe, Marcos Zampieri, Christopher M. Homan
The widespread presence of offensive language on social media motivated the development of systems capable of recognizing such content automatically.
no code implementations • NLPerspectives (LREC) 2022 • Tharindu Cyril Weerasooriya, Alexander G. Ororbia, Christopher M. Homan
We propose a fully Bayesian framework for learning ground truth labels from noisy annotators.
no code implementations • SEMEVAL 2021 • Abhinandan Desai, Kai North, Marcos Zampieri, Christopher M. Homan
This paper describes team LCP-RIT's submission to the SemEval-2021 Task 1: Lexical Complexity Prediction (LCP).
no code implementations • 31 Mar 2021 • Allahsera Auguste Tapo, Michael Leventhal, Sarah Luger, Christopher M. Homan, Marcos Zampieri
Translating to and from low-resource languages is a challenge for machine translation (MT) systems due to a lack of parallel data.
no code implementations • 31 Mar 2020 • Michael Leventhal, Allahsera Tapo, Sarah Luger, Marcos Zampieri, Christopher M. Homan
We present novel methods for assessing the quality of human-translated aligned texts for learning machine translation models of under-resourced languages.
1 code implementation • 16 Mar 2020 • Tharindu Cyril Weerasooriya, Tong Liu, Christopher M. Homan
Supervised machine learning often requires human-annotated data.
no code implementations • 30 Jan 2019 • Tong Liu, Christopher M. Homan
We present the Twitter Job/Employment Corpus, a collection of tweets annotated by a humans-in-the-loop supervised learning framework that integrates crowdsourcing contributions and expertise on the local community and employment environment.
1 code implementation • 30 Jan 2017 • Tong Liu, Qijin Cheng, Christopher M. Homan, Vincent M. B. Silenzio
Suicide is an important but often misunderstood problem, one that researchers are now seeking to better understand through social media.