Search Results for author: Jingye Chen

Found 13 papers, 8 papers with code

LLMs Meet Multimodal Generation and Editing: A Survey

1 code implementation29 May 2024 Yingqing He, Zhaoyang Liu, Jingye Chen, Zeyue Tian, Hongyu Liu, Xiaowei Chi, Runtao Liu, Ruibin Yuan, Yazhou Xing, Wenhai Wang, Jifeng Dai, Yong Zhang, Wei Xue, Qifeng Liu, Yike Guo, Qifeng Chen

With the recent advancement in large language models (LLMs), there is a growing interest in combining LLMs with multimodal learning.

multimodal generation

TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering

no code implementations28 Nov 2023 Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei

The diffusion model has been proven a powerful generative model in recent years, yet remains a challenge in generating visual text.

Language Modelling Large Language Model +1

TextDiffuser: Diffusion Models as Text Painters

no code implementations NeurIPS 2023 Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei

Diffusion models have gained increasing attention for their impressive generation abilities but currently struggle with rendering accurate and coherent text.

Optical Character Recognition (OCR)

Chinese Character Recognition with Radical-Structured Stroke Trees

no code implementations24 Nov 2022 Haiyang Yu, Jingye Chen, Bin Li, xiangyang xue

In this paper, we represent each Chinese character as a stroke tree, which is organized according to its radical structures, to fully exploit the merits of both radical and stroke levels in a decent way.

Decoder

Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study

1 code implementation30 Dec 2021 Haiyang Yu, Jingye Chen, Bin Li, jianqi ma, Mengnan Guan, Xixi Xu, Xiaocong Wang, Shaobo Qu, xiangyang xue

The experimental results indicate that the performance of baselines on CTR datasets is not as good as that on English datasets due to the characteristics of Chinese texts that are quite different from the Latin alphabet.

Attribute Benchmarking +1

MT-TransUNet: Mediating Multi-Task Tokens in Transformers for Skin Lesion Segmentation and Classification

1 code implementation3 Dec 2021 Jingye Chen, Jieneng Chen, Zongwei Zhou, Bin Li, Alan Yuille, Yongyi Lu

However, these approaches formulated skin cancer diagnosis as a simple classification task, dismissing the potential benefit from lesion segmentation.

Classification Computational Efficiency +4

Zero-Shot Chinese Character Recognition with Stroke-Level Decomposition

1 code implementation22 Jun 2021 Jingye Chen, Bin Li, xiangyang xue

Inspired by the fact that humans can generalize to know how to write characters unseen before if they have learned stroke orders of some characters, we propose a stroke-based method by decomposing each character into a sequence of strokes, which are the most basic units of Chinese characters.

Towards Brain-inspired System: Deep Recurrent Reinforcement Learning for Simulated Self-driving Agent

no code implementations29 Mar 2019 Jieneng Chen, Jingye Chen, Ruiming Zhang, Xiaobin Hu

Because of the tremendous research that focuses on human brains and reinforcement learning, scientists have investigated how robots can autonomously tackle complex tasks in the form of a self-driving agent control in a human-like way.

Decision Making OpenAI Gym +2

Cannot find the paper you are looking for? You can Submit a new open access paper.