1 code implementation • EMNLP (MRL) 2021 • Jongin Kim, Nayoung Choi, Seunghyun Lim, Jungwhan Kim, Soojin Chung, Hyunsoo Woo, Min Song, Jinho D. Choi
This paper presents a English-Korean parallel dataset that collects 381K news articles where 1, 400 of them, comprising 10K sentences, are manually labeled for crosslingual named entity recognition (NER).
no code implementations • CRAC (ACL) 2021 • Sooyoun Han, Sumin Seo, Minji Kang, Jongin Kim, Nayoung Choi, Min Song, Jinho D. Choi
This paper presents a new corpus and annotation guideline for a novel coreference resolution task on fictional texts, and analyzes its unique characteristics.
1 code implementation • 30 Nov 2023 • Jongin Kim, Byeo Rhee Back, Aditya Agrawal, Jiaxi Wu, Veronika J. Wirtz, Traci Hong, Derry Wijaya
This paper introduces a multilingual dataset of COVID-19 vaccine misinformation, consisting of annotated tweets from three middle-income countries: Brazil, Indonesia, and Nigeria.