Search Results for author: Kuniko Saito

Found 11 papers, 3 papers with code

VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents

no code implementations14 Apr 2025 Ryota Tanaka, Taichi Iki, Taku Hasegawa, Kyosuke Nishida, Kuniko Saito, Jun Suzuki

We aim to develop a retrieval-augmented generation (RAG) framework that answers questions over a corpus of visually-rich documents presented in mixed modalities (e. g., charts, tables) and diverse formats (e. g., PDF, PPTX).

Question Answering RAG +3

Wavelet-based Positional Representation for Long Context

no code implementations4 Feb 2025 Yui Oka, Taku Hasegawa, Kyosuke Nishida, Kuniko Saito

From these insights, we propose a new position representation method that captures multiple scales (i. e., window sizes) by leveraging wavelet transforms without limiting the model's attention field.

Position

Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes

no code implementations7 Oct 2024 Kosuke Nishida, Kyosuke Nishida, Kuniko Saito

WeSaR introduces a gate parameter per parameter matrix and adjusts it to the value satisfying the requirements.

InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions

1 code implementation24 Jan 2024 Ryota Tanaka, Taichi Iki, Kyosuke Nishida, Kuniko Saito, Jun Suzuki

We study the problem of completing various visual document understanding (VDU) tasks, e. g., question answering and information extraction, on real-world documents through human-written instructions.

document understanding Question Answering +1

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images

1 code implementation12 Jan 2023 Ryota Tanaka, Kyosuke Nishida, Kosuke Nishida, Taku Hasegawa, Itsumi Saito, Kuniko Saito

Visual question answering on document images that contain textual, visual, and layout information, called document VQA, has received much attention recently.

Evidence Selection Question Answering +1

Automatically Extracting Variant-Normalization Pairs for Japanese Text Normalization

no code implementations IJCNLP 2017 Itsumi Saito, Kyosuke Nishida, Kugatsu Sadamitsu, Kuniko Saito, Junji Tomita

Social media texts, such as tweets from Twitter, contain many types of non-standard tokens, and the number of normalization approaches for handling such noisy text has been increasing.

Machine Translation Morphological Analysis +1

Constructing a Class-Based Lexical Dictionary using Interactive Topic Models

no code implementations LREC 2012 Kugatsu Sadamitsu, Kuniko Saito, Kenji Imamura, Yoshihiro Matsuo

This paper proposes a new method of constructing arbitrary class-based related word dictionaries on interactive topic models; we assume that each class is described by a topic.

Topic Models

Cannot find the paper you are looking for? You can Submit a new open access paper.