Search Results for author: Jiapeng Wang

Found 10 papers, 4 papers with code

DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation

no code implementations8 Mar 2024 Jiapeng Wang, Chengyu Wang, Tingfeng Cao, Jun Huang, Lianwen Jin

We present DiffChat, a novel method to align Large Language Models (LLMs) to "chat" with prompt-as-input Text-to-Image Synthesis (TIS) models (e. g., Stable Diffusion) for interactive image creation.

Image Generation Instruction Following +1

PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction

no code implementations7 Jan 2024 Zening Lin, Jiapeng Wang, Teng Li, Wenhui Liao, Dayi Huang, Longfei Xiong, Lianwen Jin

However, simply concatenating SER and RE serially can lead to severe error propagation, and it fails to handle cases like multi-line entities in real scenarios.

Entity Linking Relation Extraction

Revisiting Scene Text Recognition: A Data Perspective

1 code implementation ICCV 2023 Qing Jiang, Jiapeng Wang, Dezhi Peng, Chongyu Liu, Lianwen Jin

To this end, we consolidate a large-scale real STR dataset, namely Union14M, which comprises 4 million labeled images and 10 million unlabeled images, to assess the performance of STR models in more complex real-world scenarios.

Scene Text Recognition

ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval

no code implementations28 May 2023 Jiapeng Wang, Chengyu Wang, Xiaodan Wang, Jun Huang, Lianwen Jin

Large-scale pre-trained text-image models with dual-encoder architectures (such as CLIP) are typically adopted for various vision-language applications, including text-image retrieval.

Image Retrieval Knowledge Distillation +2

CWP: Instance complexity weighted channel-wise soft masks for network pruning

no code implementations8 Sep 2022 Jiapeng Wang, Ming Ma, Zhenhua Yu

In this paper, we propose a simple yet effective differentiable network pruning method CWP based on instance complexity weighted filter importance scores.

Network Pruning

LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding

2 code implementations ACL 2022 Jiapeng Wang, Lianwen Jin, Kai Ding

LiLT can be pre-trained on the structured documents of a single language and then directly fine-tuned on other languages with the corresponding off-the-shelf monolingual/multilingual pre-trained textual models.

Document Image Classification document understanding +2

MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction

no code implementations24 Jun 2021 Guozhi Tang, Lele Xie, Lianwen Jin, Jiapeng Wang, Jingdong Chen, Zhen Xu, Qianying Wang, Yaqiang Wu, Hui Li

Through key-value matching based on relevancy evaluation, the proposed MatchVIE can bypass the recognitions to various semantics, and simply focuses on the strong relevancy between entities.

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

1 code implementation24 Jan 2021 Jiapeng Wang, Chongyu Liu, Lianwen Jin, Guozhi Tang, Jiaxin Zhang, Shuaitao Zhang, Qianying Wang, Yaqiang Wu, Mingxiang Cai

Visual information extraction (VIE) has attracted considerable attention recently owing to its various advanced applications such as document understanding, automatic marking and intelligent education.

3D Feature Matching document understanding +2

Joint Layout Analysis, Character Detection and Recognition for Historical Document Digitization

1 code implementation14 Jul 2020 Weihong Ma, Hesuo Zhang, Lianwen Jin, Sihang Wu, Jiapeng Wang, Yongpan Wang

In this framework, two branches named character branch and layout branch are added behind the feature extraction network.

Line Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.