Search Results for author: Jingye Chen

Found 12 papers, 7 papers with code

TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering

no code implementations • 28 Nov 2023 • Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei

The diffusion model has been proven a powerful generative model in recent years, yet remains a challenge in generating visual text.

Language Modelling Large Language Model +1

Paper
Add Code

Kosmos-2.5: A Multimodal Literate Model

no code implementations • 20 Sep 2023 • Tengchao Lv, Yupan Huang, Jingye Chen, Lei Cui, Shuming Ma, Yaoyao Chang, Shaohan Huang, Wenhui Wang, Li Dong, Weiyao Luo, Shaoxiang Wu, Guoxin Wang, Cha Zhang, Furu Wei

We present Kosmos-2. 5, a multimodal literate model for machine reading of text-intensive images.

Reading Comprehension Text Generation

Paper
Add Code

TextDiffuser: Diffusion Models as Text Painters

no code implementations • NeurIPS 2023 • Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei

Diffusion models have gained increasing attention for their impressive generation abilities but currently struggle with rendering accurate and coherent text.

Optical Character Recognition (OCR)

Paper
Add Code

Chinese Character Recognition with Radical-Structured Stroke Trees

no code implementations • 24 Nov 2022 • Haiyang Yu, Jingye Chen, Bin Li, xiangyang xue

In this paper, we represent each Chinese character as a stroke tree, which is organized according to its radical structures, to fully exploit the merits of both radical and stroke levels in a decent way.

Paper
Add Code

XDoc: Unified Pre-training for Cross-Format Document Understanding

1 code implementation • 6 Oct 2022 • Jingye Chen, Tengchao Lv, Lei Cui, Cha Zhang, Furu Wei

The surge of pre-training has witnessed the rapid development of document understanding recently.

Ranked #7 on Semantic entity labeling on FUNSD

document understanding Semantic entity labeling

18,321

Paper
Code

Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study

1 code implementation • 30 Dec 2021 • Haiyang Yu, Jingye Chen, Bin Li, jianqi ma, Mengnan Guan, Xixi Xu, Xiaocong Wang, Shaobo Qu, xiangyang xue

The experimental results indicate that the performance of baselines on CTR datasets is not as good as that on English datasets due to the characteristics of Chinese texts that are quite different from the Latin alphabet.

Attribute Benchmarking +1

384

Paper
Code

Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution

1 code implementation • 13 Dec 2021 • Jingye Chen, Haiyang Yu, jianqi ma, Bin Li, xiangyang xue

However, the recognition of low-resolution scene text images remains a challenge.

Image Super-Resolution Scene Text Recognition

309

Paper
Code

MT-TransUNet: Mediating Multi-Task Tokens in Transformers for Skin Lesion Segmentation and Classification

1 code implementation • 3 Dec 2021 • Jingye Chen, Jieneng Chen, Zongwei Zhou, Bin Li, Alan Yuille, Yongyi Lu

However, these approaches formulated skin cancer diagnosis as a simple classification task, dismissing the potential benefit from lesion segmentation.

Classification Computational Efficiency +4

Paper
Code

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models

2 code implementations • 21 Sep 2021 • Minghao Li, Tengchao Lv, Jingye Chen, Lei Cui, Yijuan Lu, Dinei Florencio, Cha Zhang, Zhoujun Li, Furu Wei

Text recognition is a long-standing research problem for document digitalization.

Ranked #3 on Handwritten Text Recognition on IAM

Handwritten Text Recognition Language Modelling +4

124,889

Paper
Code

Zero-Shot Chinese Character Recognition with Stroke-Level Decomposition

1 code implementation • 22 Jun 2021 • Jingye Chen, Bin Li, xiangyang xue

Inspired by the fact that humans can generalize to know how to write characters unseen before if they have learned stroke orders of some characters, we propose a stroke-based method by decomposing each character into a sequence of strokes, which are the most basic units of Chinese characters.

309

Paper
Code

Scene Text Telescope: Text-Focused Scene Image Super-Resolution

1 code implementation • CVPR 2021 • Jingye Chen, Bin Li, xiangyang xue

Image super-resolution, which is often regarded as a preprocessing procedure of scene text recognition, aims to recover the realistic features from a low-resolution text image.

Ranked #3 on Optical Character Recognition (OCR) on Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study

Image Super-Resolution Optical Character Recognition (OCR) +2

309

Paper
Code

Towards Brain-inspired System: Deep Recurrent Reinforcement Learning for Simulated Self-driving Agent

no code implementations • 29 Mar 2019 • Jieneng Chen, Jingye Chen, Ruiming Zhang, Xiaobin Hu

Because of the tremendous research that focuses on human brains and reinforcement learning, scientists have investigated how robots can autonomously tackle complex tasks in the form of a self-driving agent control in a human-like way.

Decision Making OpenAI Gym +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.