Search Results for author: Jiapeng Wang

Found 10 papers, 4 papers with code

DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation

no code implementations • 8 Mar 2024 • Jiapeng Wang, Chengyu Wang, Tingfeng Cao, Jun Huang, Lianwen Jin

We present DiffChat, a novel method to align Large Language Models (LLMs) to "chat" with prompt-as-input Text-to-Image Synthesis (TIS) models (e. g., Stable Diffusion) for interactive image creation.

Image Generation Instruction Following +1

Paper
Add Code

PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction

no code implementations • 7 Jan 2024 • Zening Lin, Jiapeng Wang, Teng Li, Wenhui Liao, Dayi Huang, Longfei Xiong, Lianwen Jin

However, simply concatenating SER and RE serially can lead to severe error propagation, and it fails to handle cases like multi-line entities in real scenarios.

Entity Linking Relation Extraction

Paper
Add Code

Revisiting Scene Text Recognition: A Data Perspective

1 code implementation • ICCV 2023 • Qing Jiang, Jiapeng Wang, Dezhi Peng, Chongyu Liu, Lianwen Jin

To this end, we consolidate a large-scale real STR dataset, namely Union14M, which comprises 4 million labeled images and 10 million unlabeled images, to assess the performance of STR models in more complex real-world scenarios.

Scene Text Recognition

1,853

Paper
Code

ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval

no code implementations • 28 May 2023 • Jiapeng Wang, Chengyu Wang, Xiaodan Wang, Jun Huang, Lianwen Jin

Large-scale pre-trained text-image models with dual-encoder architectures (such as CLIP) are typically adopted for various vision-language applications, including text-image retrieval.

Image Retrieval Knowledge Distillation +2

Paper
Add Code

CWP: Instance complexity weighted channel-wise soft masks for network pruning

no code implementations • 8 Sep 2022 • Jiapeng Wang, Ming Ma, Zhenhua Yu

In this paper, we propose a simple yet effective differentiable network pruning method CWP based on instance complexity weighted filter importance scores.

Network Pruning

Paper
Add Code

LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding

2 code implementations • ACL 2022 • Jiapeng Wang, Lianwen Jin, Kai Ding

LiLT can be pre-trained on the structured documents of a single language and then directly fine-tuned on other languages with the corresponding off-the-shelf monolingual/multilingual pre-trained textual models.

Ranked #5 on Key Information Extraction on CORD

Document Image Classification document understanding +2

124,889

Paper
Code

MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction

no code implementations • 24 Jun 2021 • Guozhi Tang, Lele Xie, Lianwen Jin, Jiapeng Wang, Jingdong Chen, Zhen Xu, Qianying Wang, Yaqiang Wu, Hui Li

Through key-value matching based on relevancy evaluation, the proposed MatchVIE can bypass the recognitions to various semantics, and simply focuses on the strong relevancy between entities.

Paper
Add Code

Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences

no code implementations • 20 Jun 2021 • Jiapeng Wang, Tianwei Wang, Guozhi Tang, Lianwen Jin, Weihong Ma, Kai Ding, Yichao Huang

Visual information extraction (VIE) has attracted increasing attention in recent years.

Optical Character Recognition Optical Character Recognition (OCR) +2

Paper
Add Code

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

1 code implementation • 24 Jan 2021 • Jiapeng Wang, Chongyu Liu, Lianwen Jin, Guozhi Tang, Jiaxin Zhang, Shuaitao Zhang, Qianying Wang, Yaqiang Wu, Mingxiang Cai

Visual information extraction (VIE) has attracted considerable attention recently owing to its various advanced applications such as document understanding, automatic marking and intelligent education.

3D Feature Matching document understanding +2

Paper
Code

Joint Layout Analysis, Character Detection and Recognition for Historical Document Digitization

1 code implementation • 14 Jul 2020 • Weihong Ma, Hesuo Zhang, Lianwen Jin, Sihang Wu, Jiapeng Wang, Yongpan Wang

In this framework, two branches named character branch and layout branch are added behind the feature extraction network.

Line Detection

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.