Search Results for author: Taku Hasegawa

Found 2 papers, 1 papers with code

Scene-Text Aware Image and Text Retrieval with Dual-Encoder

no code implementations ACL 2022 Shumpei Miyawaki, Taku Hasegawa, Kyosuke Nishida, Takuma Kato, Jun Suzuki

We tackle the tasks of image and text retrieval using a dual-encoder model in which images and text are encoded independently.

Retrieval Text Retrieval

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images

1 code implementation12 Jan 2023 Ryota Tanaka, Kyosuke Nishida, Kosuke Nishida, Taku Hasegawa, Itsumi Saito, Kuniko Saito

Visual question answering on document images that contain textual, visual, and layout information, called document VQA, has received much attention recently.

Evidence Selection Question Answering +1

Cannot find the paper you are looking for? You can Submit a new open access paper.