no code implementations • COLING 2022 • Abhisek Chakrabarty, Raj Dabre, Chenchen Ding, Hideki Tanaka, Masao Utiyama, Eiichiro Sumita
In this paper we present FeatureBART, a linguistically motivated sequence-to-sequence monolingual pre-training strategy in which syntactic features such as lemma, part-of-speech and dependency labels are incorporated into the span prediction based pre-training framework (BART).
no code implementations • MTSummit 2021 • Keiji Yasuda, Ichiro Yamada, Naoaki Okazaki, Hideki Tanaka, Hidehiro Asaka, Takeshi Anzai, Fumiaki Sugaya
The second technology is machine translation (MT), which enables users to read foreign news articles in their mother language.
no code implementations • AACL (WAT) 2020 • Isao Goto, Hideya Mino, Hitoshi Ito, Kazutaka Kinugawa, Ichiro Yamada, Hideki Tanaka
This paper describes the system of the NHK-NES team for the WAT 2020 Japanese–English newswire task.
no code implementations • 17 Feb 2024 • Hiroyuki Deguchi, Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe, Hideki Tanaka, Masao Utiyama
Minimum Bayes risk (MBR) decoding achieved state-of-the-art translation performance by using COMET, a neural metric that has a high correlation with human evaluation.
1 code implementation • 6 Jun 2023 • Zhishen Yang, Raj Dabre, Hideki Tanaka, Naoaki Okazaki
Automating figure caption generation helps move model understandings of scientific documents beyond text and will help authors write informative captions that facilitate communicating scientific findings.
no code implementations • LREC 2020 • Hideya Mino, Hideki Tanaka, Hitoshi Ito, Isao Goto, Ichiro Yamada, Takenobu Tokunaga
The first problem is the quality of parallel corpora.
no code implementations • WS 2019 • Hideya Mino, Hitoshi Ito, Isao Goto, Ichiro Yamada, Hideki Tanaka, Takenobu Tokunaga
The content-equivalent corpus was effective for improving translation quality, and our systems achieved the best human evaluation scores in the newswire translation tasks at WAT 2019.
no code implementations • WS 2017 • Isao Goto, Hideki Tanaka
Despite its promise, neural machine translation (NMT) has a serious problem in that source content may be mistakenly left untranslated.