Search Results for author: Shumpei Miyawaki

Found 4 papers, 1 papers with code

Scene-Text Aware Image and Text Retrieval with Dual-Encoder

no code implementations ACL 2022 Shumpei Miyawaki, Taku Hasegawa, Kyosuke Nishida, Takuma Kato, Jun Suzuki

We tackle the tasks of image and text retrieval using a dual-encoder model in which images and text are encoded independently.

Text Retrieval

Empirical Analysis of Large Vision-Language Models against Goal Hijacking via Visual Prompt Injection

no code implementations7 Aug 2024 Subaru Kimura, Ryota Tanaka, Shumpei Miyawaki, Jun Suzuki, Keisuke Sakaguchi

We explore visual prompt injection (VPI) that maliciously exploits the ability of large vision-language models (LVLMs) to follow instructions drawn onto the input image.

Instruction Following

Cannot find the paper you are looking for? You can Submit a new open access paper.