no code implementations • ACL 2022 • Ka Wong, Praveen Paritosh
In these instances, the data reliability is under-reported, and a proposed k-rater reliability (kRR) should be used as the correct data reliability for aggregated datasets.
no code implementations • ACL 2021 • Ka Wong, Praveen Paritosh, Lora Aroyo
When collecting annotations and labeled data from humans, a standard practice is to use inter-rater reliability (IRR) as a measure of data goodness (Hallgren, 2012).
no code implementations • 11 Jun 2021 • Ka Wong, Praveen Paritosh, Lora Aroyo
We present a new approach to interpreting IRR that is empirical and contextualized.