Search Results for author: Kentaro Takemoto

Found 5 papers, 3 papers with code

Rethinking VLMs and LLMs for Image Classification

no code implementations3 Oct 2024 Avi Cooper, Keizo Kato, Chia-Hsien Shih, Hiroaki Yamane, Kasper Vinken, Kentaro Takemoto, Taro Sunagawa, Hao-Wei Yeh, Jin Yamanaka, Ian Mason, Xavier Boix

Visual Language Models (VLMs) are now increasingly being merged with Large Language Models (LLMs) to enable new capabilities, particularly in terms of improved interactivity and open-ended responsiveness.

Classification image-classification +2

D3: Data Diversity Design for Systematic Generalization in Visual Question Answering

1 code implementation15 Sep 2023 Amir Rahimi, Vanessa D'Amario, Moyuru Yamada, Kentaro Takemoto, Tomotake Sasaki, Xavier Boix

We demonstrate that this result is independent of the similarity between the training and testing data and applies to well-known families of neural network architectures for VQA (i. e. monolithic architectures and neural module networks).

Diversity Question Answering +2

Transformer Module Networks for Systematic Generalization in Visual Question Answering

1 code implementation27 Jan 2022 Moyuru Yamada, Vanessa D'Amario, Kentaro Takemoto, Xavier Boix, Tomotake Sasaki

We reveal that Neural Module Networks (NMNs), i. e., question-specific compositions of modules that tackle a sub-task, achieve better or similar systematic generalization performance than the conventional Transformers, even though NMNs' modules are CNN-based.

Question Answering Systematic Generalization +1

Cannot find the paper you are looking for? You can Submit a new open access paper.