no code implementations • 14 Apr 2025 • Ryota Tanaka, Taichi Iki, Taku Hasegawa, Kyosuke Nishida, Kuniko Saito, Jun Suzuki
We aim to develop a retrieval-augmented generation (RAG) framework that answers questions over a corpus of visually-rich documents presented in mixed modalities (e. g., charts, tables) and diverse formats (e. g., PDF, PPTX).
no code implementations • 4 Feb 2025 • Yui Oka, Taku Hasegawa, Kyosuke Nishida, Kuniko Saito
From these insights, we propose a new position representation method that captures multiple scales (i. e., window sizes) by leveraging wavelet transforms without limiting the model's attention field.
1 code implementation • 15 Jan 2025 • Kazutoshi Shinoda, Nobukatsu Hojo, Kyosuke Nishida, Saki Mizuno, Keita Suzuki, Ryo Masumura, Hiroaki Sugiyama, Kuniko Saito
These verbalized thoughts serve as answers to questions designed to assess the mental states of characters within conversations.
no code implementations • 7 Oct 2024 • Kosuke Nishida, Kyosuke Nishida, Kuniko Saito
WeSaR introduces a gate parameter per parameter matrix and adjusts it to the value satisfying the requirements.
1 code implementation • 24 Jan 2024 • Ryota Tanaka, Taichi Iki, Kyosuke Nishida, Kuniko Saito, Jun Suzuki
We study the problem of completing various visual document understanding (VDU) tasks, e. g., question answering and information extraction, on real-world documents through human-written instructions.
1 code implementation • 12 Jan 2023 • Ryota Tanaka, Kyosuke Nishida, Kosuke Nishida, Taku Hasegawa, Itsumi Saito, Kuniko Saito
Visual question answering on document images that contain textual, visual, and layout information, called document VQA, has received much attention recently.
no code implementations • IJCNLP 2017 • Itsumi Saito, Kyosuke Nishida, Kugatsu Sadamitsu, Kuniko Saito, Junji Tomita
Social media texts, such as tweets from Twitter, contain many types of non-standard tokens, and the number of normalization approaches for handling such noisy text has been increasing.
no code implementations • LREC 2012 • Kugatsu Sadamitsu, Kuniko Saito, Kenji Imamura, Yoshihiro Matsuo
This paper proposes a new method of constructing arbitrary class-based related word dictionaries on interactive topic models; we assume that each class is described by a topic.