Search Results for author: Noriyuki Kojima

Found 10 papers, 3 papers with code

A Joint Study of Phrase Grounding and Task Performance in Vision and Language Models

1 code implementation6 Sep 2023 Noriyuki Kojima, Hadar Averbuch-Elor, Yoav Artzi

Key to tasks that require reasoning about natural language in visual contexts is grounding words and phrases to image regions.

Phrase Grounding

Abstract Visual Reasoning with Tangram Shapes

no code implementations29 Nov 2022 Anya Ji, Noriyuki Kojima, Noah Rush, Alane Suhr, Wai Keen Vong, Robert D. Hawkins, Yoav Artzi

We introduce KiloGram, a resource for studying abstract visual reasoning in humans and machines.

Visual Reasoning

Markup-to-Image Diffusion Models with Scheduled Sampling

1 code implementation11 Oct 2022 Yuntian Deng, Noriyuki Kojima, Alexander M. Rush

These experiments each verify the effectiveness of the diffusion process and the use of scheduled sampling to fix generation issues.

Denoising Image Generation +1

Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior

no code implementations10 Aug 2021 Noriyuki Kojima, Alane Suhr, Yoav Artzi

We study continual learning for natural language instruction generation, by observing human users' instruction execution.

Continual Learning

OASIS: A Large-Scale Dataset for Single Image 3D in the Wild

no code implementations CVPR 2020 Weifeng Chen, Shengyi Qian, David Fan, Noriyuki Kojima, Max Hamilton, Jia Deng

Single-view 3D is the task of recovering 3D properties such as depth and surface normals from a single image.

Representing Movie Characters in Dialogues

no code implementations CONLL 2019 Mahmoud Azab, Noriyuki Kojima, Jia Deng, Rada Mihalcea

We introduce a new embedding model to represent movie characters and their interactions in a dialogue by encoding in the same representation the language used by these characters as well as information about the other participants in the dialogue.

Question Answering Relation Classification +1

To Learn or Not to Learn: Analyzing the Role of Learning for Navigation in Virtual Environments

no code implementations26 Jul 2019 Noriyuki Kojima, Jia Deng

In this paper we compare learning-based methods and classical methods for navigation in virtual environments.

Collision Avoidance Management

Speaker Naming in Movies

no code implementations NAACL 2018 Mahmoud Azab, Mingzhe Wang, Max Smith, Noriyuki Kojima, Jia Deng, Rada Mihalcea

We propose a new model for speaker naming in movies that leverages visual, textual, and acoustic modalities in an unified optimization framework.

Cannot find the paper you are looking for? You can Submit a new open access paper.