Search Results for author: Chenhui Chu

Found 79 papers, 17 papers with code

Flexible Visual Grounding

1 code implementation ACL 2022 Yongmin Kim, Chenhui Chu, Sadao Kurohashi

Existing visual grounding datasets are artificially made, where every query regarding an entity must be able to be grounded to a corresponding image region, i. e., answerable.

Visual Grounding

Meta Ensemble for Japanese-Chinese Neural Machine Translation: Kyoto-U+ECNU Participation to WAT 2020

no code implementations AACL (WAT) 2020 Zhuoyuan Mao, Yibin Shen, Chenhui Chu, Sadao Kurohashi, Cheqing Jin

This paper describes the Japanese-Chinese Neural Machine Translation (NMT) system submitted by the joint team of Kyoto University and East China Normal University (Kyoto-U+ECNU) to WAT 2020 (Nakazawa et al., 2020).

Denoising Machine Translation +2

Beyond English-Centric LLMs: What Language Do Multilingual Language Models Think in?

no code implementations20 Aug 2024 Chengzhi Zhong, Fei Cheng, Qianying Liu, Junfeng Jiang, Zhen Wan, Chenhui Chu, Yugo Murawaki, Sadao Kurohashi

We examine the latent language of three typical categories of models for Japanese processing: Llama2, an English-centric model; Swallow, an English-centric model with continued pre-training in Japanese; and LLM-jp, a model pre-trained on balanced English and Japanese corpora.

StyEmp: Stylizing Empathetic Response Generation via Multi-Grained Prefix Encoder and Personality Reinforcement

no code implementations5 Aug 2024 Yahui Fu, Chenhui Chu, Tatsuya Kawahara

Recent approaches for empathetic response generation mainly focus on emotional resonance and user understanding, without considering the system's personality.

Contrastive Learning Empathetic Response Generation +1

Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents

no code implementations23 Mar 2024 Hao Wang, Tang Li, Chenhui Chu, Nengjun Zhu, Rui Wang, Pinpin Zhu

This approach aims to generate relation representations that are more aware of the spatial context and unseen relation in a manner similar to human perception.

Document AI Reading Comprehension +2

MOS-FAD: Improving Fake Audio Detection Via Automatic Mean Opinion Score Prediction

no code implementations24 Jan 2024 Wangjin Zhou, Zhengdong Yang, Chenhui Chu, Sheng Li, Raj Dabre, Yi Zhao, Tatsuya Kawahara

We propose MOS-FAD, where MOS can be leveraged at two key points in FAD: training data selection and model fusion.

FAD

MM-LLMs: Recent Advances in MultiModal Large Language Models

no code implementations24 Jan 2024 Duzhen Zhang, Yahan Yu, Jiahua Dong, Chenxing Li, Dan Su, Chenhui Chu, Dong Yu

In the past year, MultiModal Large Language Models (MM-LLMs) have undergone substantial advancements, augmenting off-the-shelf LLMs to support MM inputs or outputs via cost-effective training strategies.

Decision Making

Bilingual Corpus Mining and Multistage Fine-Tuning for Improving Machine Translation of Lecture Transcripts

1 code implementation7 Nov 2023 Haiyue Song, Raj Dabre, Chenhui Chu, Atsushi Fujita, Sadao Kurohashi

To create the parallel corpora, we propose a dynamic programming based sentence alignment algorithm which leverages the cosine similarity of machine-translated sentences.

Benchmarking Machine Translation +3

Video-Helpful Multimodal Machine Translation

1 code implementation31 Oct 2023 Yihang Li, Shuichiro Shimizu, Chenhui Chu, Sadao Kurohashi, Wei Li

In addition to the extensive training set, EVA contains a video-helpful evaluation set in which subtitles are ambiguous, and videos are guaranteed helpful for disambiguation.

Multimodal Machine Translation Translation

DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading

1 code implementation23 Oct 2023 Hao Wang, Qingxuan Wang, Yue Li, Changqing Wang, Chenhui Chu, Rui Wang

The use of visually-rich documents (VRDs) in various fields has created a demand for Document AI models that can read and comprehend documents like humans, which requires the overcoming of technical, linguistic, and cognitive barriers.

document understanding Reading Comprehension

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning

no code implementations23 Oct 2023 Hao Wang, Xiahua Chen, Rui Wang, Chenhui Chu

Extracting meaningful entities belonging to predefined categories from Visually-rich Form-like Documents (VFDs) is a challenging task.

Variable-length Neural Interlingua Representations for Zero-shot Neural Machine Translation

no code implementations17 May 2023 Zhuoyuan Mao, Haiyue Song, Raj Dabre, Chenhui Chu, Sadao Kurohashi

The language-independency of encoded representations within multilingual neural machine translation (MNMT) models is crucial for their generalization ability on zero-shot translation.

Machine Translation Translation

Towards Speech Dialogue Translation Mediating Speakers of Different Languages

1 code implementation16 May 2023 Shuichiro Shimizu, Chenhui Chu, Sheng Li, Sadao Kurohashi

We present a new task, speech dialogue translation mediating speakers of different languages.

Translation

EMS: Efficient and Effective Massively Multilingual Sentence Embedding Learning

1 code implementation31 May 2022 Zhuoyuan Mao, Chenhui Chu, Sadao Kurohashi

Massively multilingual sentence representation models, e. g., LASER, SBERT-distill, and LaBSE, help significantly improve cross-lingual downstream tasks.

Contrastive Learning Genre classification +7

When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?

no code implementations Findings (NAACL) 2022 Zhuoyuan Mao, Chenhui Chu, Raj Dabre, Haiyue Song, Zhen Wan, Sadao Kurohashi

Meanwhile, the contrastive objective can implicitly utilize automatically learned word alignment, which has not been explored in many-to-many NMT.

Machine Translation NMT +5

Fusion of Self-supervised Learned Models for MOS Prediction

no code implementations11 Apr 2022 Zhengdong Yang, Wangjin Zhou, Chenhui Chu, Sheng Li, Raj Dabre, Raphael Rubino, Yi Zhao

This challenge aims to predict MOS scores of synthetic speech on two tracks, the main track and a more challenging sub-track: out-of-domain (OOD).

Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation

1 code implementation20 Jan 2022 Zhuoyuan Mao, Chenhui Chu, Sadao Kurohashi

In the present study, we propose novel sequence-to-sequence pre-training objectives for low-resource machine translation (NMT): Japanese-specific sequence to sequence (JASS) for language pairs involving Japanese as the source or target language, and English-specific sequence to sequence (ENSS) for language pairs involving English.

Low Resource Neural Machine Translation Low-Resource Neural Machine Translation +2

VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation

1 code implementation LREC 2022 Yihang Li, Shuichiro Shimizu, Weiqi Gu, Chenhui Chu, Sadao Kurohashi

Existing multimodal machine translation (MMT) datasets consist of images and video captions or general subtitles, which rarely contain linguistic ambiguity, making visual information not so effective to generate appropriate translations.

Multimodal Machine Translation Sentence +1

Attending Self-Attention: A Case Study of Visually Grounded Supervision in Vision-and-Language Transformers

no code implementations ACL 2021 Jules Samaran, Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima

The impressive performances of pre-trained visually grounded language models have motivated a growing body of research investigating what has been learned during the pre-training.

Language Modelling Visual Grounding

Video-guided Machine Translation with Spatial Hierarchical Attention Network

no code implementations ACL 2021 Weiqi Gu, Haiyue Song, Chenhui Chu, Sadao Kurohashi

Video-guided machine translation, as one type of multimodal machine translations, aims to engage video contents as auxiliary information to address the word sense ambiguity problem in machine translation.

Action Detection Machine Translation +2

A Picture May Be Worth a Hundred Words for Visual Question Answering

no code implementations25 Jun 2021 Yusuke Hirota, Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima, Ittetsu Taniguchi, Takao Onoye

This paper delves into the effectiveness of textual representations for image understanding in the specific context of VQA.

Data Augmentation Descriptive +3

WRIME: A New Dataset for Emotional Intensity Estimation with Subjective and Objective Annotations

1 code implementation NAACL 2021 Tomoyuki Kajiwara, Chenhui Chu, Noriko Takemura, Yuta Nakashima, Hajime Nagahara

We annotate 17, 000 SNS posts with both the writer{'}s subjective emotional intensity and the reader{'}s objective one to construct a Japanese emotion analysis dataset.

Emotion Recognition

Lightweight Cross-Lingual Sentence Representation Learning

1 code implementation ACL 2021 Zhuoyuan Mao, Prakhar Gupta, Pei Wang, Chenhui Chu, Martin Jaggi, Sadao Kurohashi

Large-scale models for learning fixed-dimensional cross-lingual sentence representations like LASER (Artetxe and Schwenk, 2019b) lead to significant improvement in performance on downstream tasks.

Contrastive Learning Document Classification +5

Understanding the Role of Scene Graphs in Visual Question Answering

no code implementations14 Jan 2021 Vinay Damodaran, Sharanya Chakravarthy, Akshay Kumar, Anjana Umapathy, Teruko Mitamura, Yuta Nakashima, Noa Garcia, Chenhui Chu

Visual Question Answering (VQA) is of tremendous interest to the research community with important applications such as aiding visually impaired users and image-based search.

Graph Generation Question Answering +2

Multilingual Neural Machine Translation

no code implementations COLING 2020 Raj Dabre, Chenhui Chu, Anoop Kunchukuttan

The advent of neural machine translation (NMT) has opened up exciting research in building multilingual translation systems i. e. translation models that can handle more than one language pair.

Machine Translation NMT +2

Double Attention-based Multimodal Neural Machine Translation with Semantic Image Regions

1 code implementation EAMT 2020 YuTing Zhao, Mamoru Komachi, Tomoyuki Kajiwara, Chenhui Chu

In contrast, we propose the application of semantic image regions for MNMT by integrating visual and textual features using two individual attention mechanisms (double attention).

Machine Translation Translation

A Corpus for English-Japanese Multimodal Neural Machine Translation with Comparable Sentences

no code implementations17 Oct 2020 Andrew Merritt, Chenhui Chu, Yuki Arase

Multimodal neural machine translation (NMT) has become an increasingly important area of research over the years because additional modalities, such as image data, can provide more context to textual data.

Image Captioning Machine Translation +3

Lexically Cohesive Neural Machine Translation with Copy Mechanism

no code implementations11 Oct 2020 Vipul Mishra, Chenhui Chu, Yuki Arase

Lexically cohesive translations preserve consistency in word choices in document-level translation.

Machine Translation Translation

Text Classification with Negative Supervision

no code implementations ACL 2020 Sora Ohashi, Junya Takayama, Tomoyuki Kajiwara, Chenhui Chu, Yuki Arase

Advanced pre-trained models for text representation have achieved state-of-the-art performance on various text classification tasks.

General Classification Semantic Similarity +4

Annotation of Adverse Drug Reactions in Patients' Weblogs

no code implementations LREC 2020 Yuki Arase, Tomoyuki Kajiwara, Chenhui Chu

The dataset we present in this paper is unique for the richness of annotated information, including detailed descriptions of drug reactions with full context.

Constructing a Public Meeting Corpus

no code implementations LREC 2020 Koji Tanaka, Chenhui Chu, Haolin Ren, Benjamin Renoust, Yuta Nakashima, Noriko Takemura, Hajime Nagahara, Takao Fujikawa

In this paper, we propose a full pipeline of analysis of a large corpus about a century of public meeting in historical Australian news papers, from construction to visual exploration.

Optical Character Recognition (OCR)

Knowledge-Based Visual Question Answering in Videos

no code implementations17 Apr 2020 Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima

We propose a novel video understanding task by fusing knowledge-based and video question answering.

Question Answering Video Question Answering +2

A Comprehensive Survey of Multilingual Neural Machine Translation

no code implementations4 Jan 2020 Raj Dabre, Chenhui Chu, Anoop Kunchukuttan

We present a survey on multilingual neural machine translation (MNMT), which has gained a lot of traction in the recent years.

Machine Translation NMT +2

Revisiting Simple Domain Adaptation Methods in Unsupervised Neural Machine Translation

no code implementations26 Aug 2019 Haipeng Sun, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita, Tiejun Zhao, Chenhui Chu

However, it has not been well-studied for unsupervised neural machine translation (UNMT), although UNMT has recently achieved remarkable results in several domain-specific language pairs.

Domain Adaptation Machine Translation +1

Multilingual Multi-Domain Adaptation Approaches for Neural Machine Translation

no code implementations19 Jun 2019 Chenhui Chu, Raj Dabre

In this paper, we propose two novel methods for domain adaptation for the attention-only neural machine translation (NMT) model, i. e., the Transformer.

Domain Adaptation Machine Translation +2

Using Natural Language Processing to Develop an Automated Orthodontic Diagnostic System

no code implementations31 May 2019 Tomoyuki Kajiwara, Chihiro Tanikawa, Yuujin Shimizu, Chenhui Chu, Takashi Yamashiro, Hajime Nagahara

We work on the task of automatically designing a treatment plan from the findings included in the medical certificate written by the dentist.

A Brief Survey of Multilingual Neural Machine Translation

no code implementations14 May 2019 Raj Dabre, Chenhui Chu, Anoop Kunchukuttan

We present a survey on multilingual neural machine translation (MNMT), which has gained a lot of traction in the recent years.

Machine Translation Transfer Learning +1

iParaphrasing: Extracting Visually Grounded Paraphrases via an Image

1 code implementation COLING 2018 Chenhui Chu, Mayu Otani, Yuta Nakashima

These extracted VGPs have the potential to improve language and image multimodal tasks such as visual question answering and image captioning.

Image Captioning Question Answering +1

A Survey of Domain Adaptation for Neural Machine Translation

no code implementations COLING 2018 Chenhui Chu, Rui Wang

Neural machine translation (NMT) is a deep learning based approach for machine translation, which yields the state-of-the-art translation performance in scenarios where large-scale parallel corpora are available.

Domain Adaptation Machine Translation +2

Recursive Neural Network Based Preordering for English-to-Japanese Machine Translation

no code implementations ACL 2018 Yuki Kawara, Chenhui Chu, Yuki Arase

Experiments show that the proposed method achieves comparable gain in translation quality to the state-of-the-art method but without a manual feature design.

Machine Translation Translation

An Empirical Comparison of Simple Domain Adaptation Methods for Neural Machine Translation

no code implementations12 Jan 2017 Chenhui Chu, Raj Dabre, Sadao Kurohashi

In this paper, we propose a novel domain adaptation method named "mixed fine tuning" for neural machine translation (NMT).

Domain Adaptation Machine Translation +2

Consistent Word Segmentation, Part-of-Speech Tagging and Dependency Labelling Annotation for Chinese Language

no code implementations COLING 2016 Mo Shen, Wingmui Li, HyunJeong Choe, Chenhui Chu, Daisuke Kawahara, Sadao Kurohashi

In this paper, we propose a new annotation approach to Chinese word segmentation, part-of-speech (POS) tagging and dependency labelling that aims to overcome the two major issues in traditional morphology-based annotation: Inconsistency and data sparsity.

Chinese Word Segmentation Machine Translation +6

Supervised Syntax-based Alignment between English Sentences and Abstract Meaning Representation Graphs

no code implementations7 Jun 2016 Chenhui Chu, Sadao Kurohashi

As alignment links are not given between English sentences and Abstract Meaning Representation (AMR) graphs in the AMR annotation, automatic alignment becomes indispensable for training an AMR parser.

Abstract Meaning Representation AMR Parsing

Parallel Sentence Extraction from Comparable Corpora with Neural Network Features

no code implementations LREC 2016 Chenhui Chu, Raj Dabre, Sadao Kurohashi

Parallel corpora are crucial for machine translation (MT), however they are quite scarce for most language pairs and domains.

Machine Translation Sentence +1

Constructing a Chinese---Japanese Parallel Corpus from Wikipedia

no code implementations LREC 2014 Chenhui Chu, Toshiaki Nakazawa, Sadao Kurohashi

Using the system, we construct a Chinese―Japanese parallel corpus with more than 126k highly accurate parallel sentences from Wikipedia.

Machine Translation Sentence +1

Cannot find the paper you are looking for? You can Submit a new open access paper.