Search Results for author: Takashi Miyazaki

Found 7 papers, 0 papers with code

A Visually-grounded First-person Dialogue Dataset with Verbal and Non-verbal Responses

no code implementations • EMNLP 2020 • Hisashi Kamezawa, Noriki Nishida, Nobuyuki Shimizu, Takashi Miyazaki, Hideki Nakayama

The results demonstrate that first-person vision helps neural network models correctly understand human intentions, and the production of non-verbal responses is a challenging task like that of verbal responses.

Paper
Add Code

RNSum: A Large-Scale Dataset for Automatic Release Note Generation via Commit Logs Summarization

no code implementations • ACL 2022 • Hisashi Kamezawa, Noriki Nishida, Nobuyuki Shimizu, Takashi Miyazaki, Hideki Nakayama

A release note is a technical document that describes the latest changes to a software product and is crucial in open source software development.

Abstractive Text Summarization

Paper
Add Code

How do people talk about images? A study on open-domain conversations with images.

no code implementations • NAACL (ACL) 2022 • Yi-Pei Chen, Nobuyuki Shimizu, Takashi Miyazaki, Hideki Nakayama

This paper explores how humans conduct conversations with images by investigating an open-domain image conversation dataset, ImageChat.

Paper
Add Code

Ladder Siamese Network: a Method and Insights for Multi-level Self-Supervised Learning

no code implementations • 25 Nov 2022 • Ryota Yoshihashi, Shuhei Nishimura, Dai Yonebayashi, Yuya Otsuka, Tomohiro Tanaka, Takashi Miyazaki

Siamese-network-based self-supervised learning (SSL) suffers from slow convergence and instability in training.

Self-Supervised Learning

Paper
Add Code

Deep Learning Based Multi-modal Addressee Recognition in Visual Scenes with Utterances

no code implementations • 12 Sep 2018 • Thao Minh Le, Nobuyuki Shimizu, Takashi Miyazaki, Koichi Shinoda

With the widespread use of intelligent systems, such as smart speakers, addressee recognition has become a concern in human-computer interaction, as more and more people expect such systems to understand complicated social scenes, including those outdoors, in cafeterias, and hospitals.

Paper
Add Code

Visual Question Answering Dataset for Bilingual Image Understanding: A Study of Cross-Lingual Transfer Using Attention Maps

no code implementations • COLING 2018 • Nobuyuki Shimizu, Na Rong, Takashi Miyazaki

The proposed method is based on a popular VQA method that uses an attention mechanism.

Cross-Lingual Transfer Image Captioning +2

Paper
Add Code

Cross-Lingual Image Caption Generation

no code implementations • ACL 2016 • Takashi Miyazaki, Nobuyuki Shimizu

Caption Generation Dependency Parsing +5

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.