no code implementations • NAACL (ACL) 2022 • Xiaomeng Pan, Hongfei Wang, Teruaki Oka, Mamoru Komachi
Creation of an ancient Chinese dataset is considered a significant challenge because determining the most appropriate sense in a context is difficult and time-consuming owing to the different usages in ancient and modern Chinese.
no code implementations • LREC 2020 • Teruaki Oka, Yuichi Ishimoto, Yutaka Yagi, Takenori Nakamura, Masayuki Asahara, Kikuo Maekawa, Toshinobu Ogiso, Hanae Koiso, Kumiko Sakoda, Nobuko Kibe
The National Institute for Japanese Language and Linguistics, Japan (NINJAL, Japan), has developed several types of corpora.
no code implementations • WS 2016 • Teruaki Oka, Tomoaki Kono
Moreover, we found that we can locate the uncertain transcriptionsin our corpus and compare them to other transcriptions, by using the alignment probabilities.