Search Results for author: Yasuhisa Fujii

Found 17 papers, 5 papers with code

Sequence-to-Label Script Identification for Multilingual OCR

no code implementations15 Aug 2017 Yasuhisa Fujii, Karel Driesen, Jonathan Baccash, Ash Hurst, Ashok C. Popat

Therefore we reframe line script identification as a sequence-to-label problem and solve it using two components, trained end-toend: Encoder and Summarizer.

Optical Character Recognition (OCR)

Towards Unconstrained End-to-End Text Spotting

no code implementations ICCV 2019 Siyang Qin, Alessandro Bissacco, Michalis Raptis, Yasuhisa Fujii, Ying Xiao

We propose an end-to-end trainable network that can simultaneously detect and recognize text of arbitrary shape, making substantial progress on the open problem of reading scene text of irregular shape.

Instance Segmentation Optical Character Recognition (OCR) +3

Post-OCR Paragraph Recognition by Graph Convolutional Networks

no code implementations29 Jan 2021 Renshen Wang, Yasuhisa Fujii, Ashok C. Popat

We propose a new approach for paragraph recognition in document images by spatial graph convolutional networks (GCN) applied on OCR text boxes.

Clustering Optical Character Recognition (OCR)

Rethinking Text Line Recognition Models

1 code implementation15 Apr 2021 Daniel Hernandez Diaz, Siyang Qin, Reeve Ingle, Yasuhisa Fujii, Alessandro Bissacco

Unlike the more common Transformer-based models, this architecture can handle inputs of arbitrary length, a requirement for universal line recognition.

Ranked #2 on Handwritten Text Recognition on IAM (using extra training data)

Handwritten Text Recognition Language Modelling

Unified Line and Paragraph Detection by Graph Convolutional Networks

no code implementations17 Mar 2022 Shuang Liu, Renshen Wang, Michalis Raptis, Yasuhisa Fujii

We formulate the task of detecting lines and paragraphs in a document into a unified two-level clustering problem.

Clustering Text Detection

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

1 code implementation3 May 2023 Cheng-Yu Hsieh, Chun-Liang Li, Chih-Kuan Yeh, Hootan Nakhost, Yasuhisa Fujii, Alexander Ratner, Ranjay Krishna, Chen-Yu Lee, Tomas Pfister

Third, we reduce both the model size and the amount of data required to outperform LLMs; our finetuned 770M T5 model outperforms the few-shot prompted 540B PaLM model using only 80% of available data on a benchmark, whereas standard finetuning the same T5 model struggles to match even by using 100% of the dataset.

Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation

no code implementations4 May 2023 Renshen Wang, Yasuhisa Fujii, Alessandro Bissacco

Text reading order is a crucial aspect in the output of an OCR engine, with a large impact on downstream tasks.

Optical Character Recognition (OCR)

OCR Language Models with Custom Vocabularies

no code implementations18 Aug 2023 Peter Garst, Reeve Ingle, Yasuhisa Fujii

Language models are useful adjuncts to optical models for producing accurate optical character recognition (OCR) results.

Language Modelling Optical Character Recognition +1

Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis

1 code implementation25 Oct 2023 Shangbang Long, Siyang Qin, Yasuhisa Fujii, Alessandro Bissacco, Michalis Raptis

We propose Hierarchical Text Spotter (HTS), a novel method for the joint task of word-level text spotting and geometric layout analysis.

Text Spotting

Cannot find the paper you are looking for? You can Submit a new open access paper.