Search Results for author: Zhanzhan Cheng

Found 34 papers, 15 papers with code

Read Extensively, Focus Smartly: A Cross-document Semantic Enhancement Method for Visual Documents NER

no code implementations • COLING 2022 • Jun Zhao, Xin Zhao, WenYu Zhan, Tao Gui, Qi Zhang, Liang Qiao, Zhanzhan Cheng, ShiLiang Pu

To deal with this problem, this work proposes a cross-document semantic enhancement method, which consists of two modules: 1) To prevent distractions from irrelevant regions in the current document, we design a learnable attention mask mechanism, which is used to adaptively filter redundant information in the current document.

NER

Paper
Add Code

Flooding-X: Improving BERT’s Resistance to Adversarial Attacks via Loss-Restricted Fine-Tuning

2 code implementations • ACL 2022 • Qin Liu, Rui Zheng, Bao Rong, Jingyi Liu, Zhihua Liu, Zhanzhan Cheng, Liang Qiao, Tao Gui, Qi Zhang, Xuanjing Huang

Adversarial robustness has attracted much attention recently, and the mainstream solution is adversarial training.

Adversarial Robustness text-classification +1

Paper
Code

Few-Shot Class-Incremental Learning via Class-Aware Bilateral Distillation

1 code implementation • CVPR 2023 • Linglan Zhao, Jing Lu, Yunlu Xu, Zhanzhan Cheng, Dashan Guo, Yi Niu, Xiangzhong Fang

While knowledge distillation, a prevailing technique in CIL, can alleviate the catastrophic forgetting of older classes by regularizing outputs between current and previous model, it fails to consider the overfitting risk of novel classes in FSCIL.

Few-Shot Class-Incremental Learning General Knowledge +3

Paper
Code

HyperMatch: Noise-Tolerant Semi-Supervised Learning via Relaxed Contrastive Constraint

no code implementations • CVPR 2023 • Beitong Zhou, Jing Lu, Kerui Liu, Yunlu Xu, Zhanzhan Cheng, Yi Niu

Recent developments of the application of Contrastive Learning in Semi-Supervised Learning (SSL) have demonstrated significant advancements, as a result of its exceptional ability to learn class-aware cluster representations and the full exploitation of massive unlabeled data.

Contrastive Learning

Paper
Add Code

Distilling Object Detectors With Global Knowledge

1 code implementation • 17 Oct 2022 • Sanli Tang, Zhongyu Zhang, Zhanzhan Cheng, Jing Lu, Yunlu Xu, Yi Niu, Fan He

Then, a robust distilling module (RDM) is applied to construct the global knowledge based on the prototypes and filtrate noisy global and local knowledge by measuring the discrepancy of the representations in two feature spaces.

Knowledge Distillation Object +2

Paper
Code

Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting

1 code implementation • 14 Jul 2022 • Ying Chen, Liang Qiao, Zhanzhan Cheng, ShiLiang Pu, Yi Niu, Xi Li

In this paper, to address this problem, we propose a novel cost-efficient Dynamic Low-resolution Distillation (DLD) text spotting framework, which aims to infer images in different small but recognizable resolutions and achieve a better balance between accuracy and efficiency.

Knowledge Distillation Optical Character Recognition (OCR) +1

708

Paper
Code

DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding

1 code implementation • 14 Jul 2022 • Liang Qiao, Hui Jiang, Ying Chen, Can Li, Pengfei Li, Zaisheng Li, Baorui Zou, Dashan Guo, Yingda Xu, Yunlu Xu, Zhanzhan Cheng, Yi Niu

Compared with the previous opensource OCR toolbox, DavarOCR has relatively more complete support for the sub-tasks of the cutting-edge technology of document understanding.

document understanding Optical Character Recognition (OCR)

708

Paper
Code

TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents

no code implementations • 14 Jul 2022 • Zhanzhan Cheng, Peng Zhang, Can Li, Qiao Liang, Yunlu Xu, Pengfei Li, ShiLiang Pu, Yi Niu, Fei Wu

Most existing methods divide this task into two subparts: the text reading part for obtaining the plain text from the original document images and the information extraction part for extracting key contents.

Language Modelling

Paper
Add Code

E2-AEN: End-to-End Incremental Learning with Adaptively Expandable Network

no code implementations • 14 Jul 2022 • Guimei Cao, Zhanzhan Cheng, Yunlu Xu, Duo Li, ShiLiang Pu, Yi Niu, Fei Wu

In this paper, we propose an end-to-end trainable adaptively expandable network named E2-AEN, which dynamically generates lightweight structures for new tasks without any accuracy drop in previous tasks.

Incremental Learning

Paper
Add Code

MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective

2 code implementations • ACL 2022 • Xiao Wang, Shihan Dou, Limao Xiong, Yicheng Zou, Qi Zhang, Tao Gui, Liang Qiao, Zhanzhan Cheng, Xuanjing Huang

NER model has achieved promising performance on standard NER benchmarks.

Ranked #8 on Named Entity Recognition (NER) on WNUT 2017

named-entity-recognition Named Entity Recognition +1

Paper
Code

PMAL: Open Set Recognition via Robust Prototype Mining

no code implementations • 16 Mar 2022 • Jing Lu, Yunxu Xu, Hao Li, Zhanzhan Cheng, Yi Niu

Accordingly, the embedding space can be better optimized to discriminate therein the predefined classes and between known and unknowns.

Open Set Learning

Paper
Add Code

Technical Report for ICCV 2021 Challenge SSLAD-Track3B: Transformers Are Better Continual Learners

no code implementations • 13 Jan 2022 • Duo Li, Guimei Cao, Yunlu Xu, Zhanzhan Cheng, Yi Niu

In the SSLAD-Track 3B challenge on continual learning, we propose the method of COntinual Learning with Transformer (COLT).

Continual Learning Knowledge Distillation +2

Paper
Add Code

STEP: Out-of-Distribution Detection in the Presence of Limited In-Distribution Labeled Data

no code implementations • NeurIPS 2021 • Zhi Zhou, Lan-Zhe Guo, Zhanzhan Cheng, Yu-Feng Li, ShiLiang Pu

However, in many real-world applications, it is desirable to have SSL algorithms that not only classify the samples drawn from the same distribution of labeled data but also detect out-of-distribution (OOD) samples drawn from an unknown distribution.

Out-of-Distribution Detection Out of Distribution (OOD) Detection

Paper
Add Code

A Strong Baseline for Semi-Supervised Incremental Few-Shot Learning

no code implementations • 21 Oct 2021 • Linlan Zhao, Dashan Guo, Yunlu Xu, Liang Qiao, Zhanzhan Cheng, ShiLiang Pu, Yi Niu, Xiangzhong Fang

Few-shot learning (FSL) aims to learn models that generalize to novel classes with limited training samples.

Few-Shot Learning

Paper
Add Code

ICDAR 2021 Competition on Scene Video Text Spotting

no code implementations • 26 Jul 2021 • Zhanzhan Cheng, Jing Lu, Baorui Zou, Shuigeng Zhou, Fei Wu

During the competition period (opened on 1st March, 2021 and closed on 11th April, 2021), a total of 24 teams participated in the three proposed tasks with 46 valid submissions, respectively.

Task 2 Text Detection +2

Paper
Add Code

VSR: A Unified Framework for Document Layout Analysis combining Vision, Semantics and Relations

1 code implementation • 13 May 2021 • Peng Zhang, Can Li, Liang Qiao, Zhanzhan Cheng, ShiLiang Pu, Yi Niu, Fei Wu

To address the above limitations, we propose a unified framework VSR for document layout analysis, combining vision, semantics and relations.

Ranked #3 on Document Layout Analysis on PubLayNet val

Document Layout Analysis Relation

708

Paper
Code

Reciprocal Feature Learning via Explicit and Implicit Tasks in Scene Text Recognition

1 code implementation • 13 May 2021 • Hui Jiang, Yunlu Xu, Zhanzhan Cheng, ShiLiang Pu, Yi Niu, Wenqi Ren, Fei Wu, Wenming Tan

In this work, we excavate the implicit task, character counting within the traditional text recognition, without additional labor annotation cost.

Optical Character Recognition (OCR) Scene Text Recognition

708

Paper
Code

LGPMA: Complicated Table Structure Recognition with Local and Global Pyramid Mask Alignment

1 code implementation • 13 May 2021 • Liang Qiao, Zaisheng Li, Zhanzhan Cheng, Peng Zhang, ShiLiang Pu, Yi Niu, Wenqi Ren, Wenming Tan, Fei Wu

In this paper, we aim to obtain more reliable aligned bounding boxes by fully utilizing the visual information from both text regions in proposed local features and cell relations in global features.

Ranked #7 on Table Recognition on PubTabNet

Table Recognition

708

Paper
Code

Rethinking Pseudo-labeled Sample Mining for Semi-Supervised Object Detection

no code implementations • 1 Jan 2021 • Duo Li, Sanli Tang, Zhanzhan Cheng, ShiLiang Pu, Yi Niu, Wenming Tan, Fei Wu, Xiaokang Yang

However, the impact of the pseudo-labeled samples' quality as well as the mining strategies for high quality training sample have rarely been studied in SSL.

object-detection Object Detection +1

Paper
Add Code

MANGO: A Mask Attention Guided One-Stage Scene Text Spotter

1 code implementation • 8 Dec 2020 • Liang Qiao, Ying Chen, Zhanzhan Cheng, Yunlu Xu, Yi Niu, ShiLiang Pu, Fei Wu

Recently end-to-end scene text spotting has become a popular research topic due to its advantages of global optimization and high maintainability in real applications.

Ranked #6 on Text Spotting on SCUT-CTW1500

Position Text Spotting

708

Paper
Code

Learning a Domain Classifier Bank for Unsupervised Adaptive Object Detection

no code implementations • 6 Jul 2020 • Sanli Tang, Zhanzhan Cheng, ShiLiang Pu, Dashan Guo, Yi Niu, Fei Wu

To tackle this issue, we develop a fine-grained domain alignment approach with a well-designed domain classifier bank that achieves the instance-level alignment respecting to their categories.

Object object-detection +1

Paper
Add Code

Text Recognition in Real Scenarios with a Few Labeled Samples

no code implementations • 22 Jun 2020 • Jinghuang Lin, Zhanzhan Cheng, Fan Bai, Yi Niu, ShiLiang Pu, Shuigeng Zhou

Scene text recognition (STR) is still a hot research topic in computer vision field due to its various applications.

Domain Adaptation Scene Text Recognition

Paper
Add Code

TRIE: End-to-End Text Reading and Information Extraction for Document Understanding

1 code implementation • 27 May 2020 • Peng Zhang, Yunlu Xu, Zhanzhan Cheng, ShiLiang Pu, Jing Lu, Liang Qiao, Yi Niu, Fei Wu

Since real-world ubiquitous documents (e. g., invoices, tickets, resumes and leaflets) contain rich information, automatic document image understanding has become a hot topic.

document understanding

708

Paper
Code

Object-QA: Towards High Reliable Object Quality Assessment

no code implementations • 27 May 2020 • Jing Lu, Baorui Zou, Zhanzhan Cheng, ShiLiang Pu, Shuigeng Zhou, Yi Niu, Fei Wu

In this paper, we define the problem of object quality assessment for the first time and propose an effective approach named Object-QA to assess high-reliable quality scores for object images.

Object Object Recognition +1

Paper
Add Code

SPIN: Structure-Preserving Inner Offset Network for Scene Text Recognition

3 code implementations • 27 May 2020 • Chengwei Zhang, Yunlu Xu, Zhanzhan Cheng, ShiLiang Pu, Yi Niu, Fei Wu, Futai Zou

Arbitrary text appearance poses a great challenge in scene text recognition tasks.

Color Manipulation Scene Text Recognition

38,418

Paper
Code

Refined Gate: A Simple and Effective Gating Mechanism for Recurrent Units

no code implementations • 26 Feb 2020 • Zhanzhan Cheng, Yunlu Xu, Mingjian Cheng, Yu Qiao, ShiLiang Pu, Yi Niu, Fei Wu

Recurrent neural network (RNN) has been widely studied in sequence learning tasks, while the mainstream models (e. g., LSTM and GRU) rely on the gating mechanism (in control of how information flows between hidden states).

Language Modelling Scene Text Recognition

Paper
Add Code

Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting

1 code implementation • 17 Feb 2020 • Liang Qiao, Sanli Tang, Zhanzhan Cheng, Yunlu Xu, Yi Niu, ShiLiang Pu, Fei Wu

Many approaches have recently been proposed to detect irregular scene text and achieved promising results.

Ranked #8 on Text Spotting on SCUT-CTW1500

Text Detection Text Spotting

708

Paper
Code

Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal Action Localization

no code implementations • 7 Aug 2019 • Chengwei Zhang, Yunlu Xu, Zhanzhan Cheng, Yi Niu, ShiLiang Pu, Fei Wu, Futai Zou

The second module is a specific classifier for mining trivial or incomplete action regions, which is trained on the shared features after erasing the seeded regions activated by SSG.

Action Detection Weakly-supervised Temporal Action Localization +1

Paper
Add Code

REAPS: Towards Better Recognition of Fine-grained Images by Region Attending and Part Sequencing

no code implementations • 6 Aug 2019 • Peng Zhang, Xinyu Zhu, Zhanzhan Cheng, Shuigeng Zhou, Yi Niu

Fine-grained image recognition has been a hot research topic in computer vision due to its various applications.

Fine-Grained Image Recognition

Paper
Add Code

You Only Recognize Once: Towards Fast Video Text Spotting

1 code implementation • 8 Mar 2019 • Zhanzhan Cheng, Jing Lu, Yi Niu, ShiLiang Pu, Fei Wu, Shuigeng Zhou

Video text spotting is still an important research topic due to its various real-applications.

Text Detection Text Spotting

708

Paper
Code

Segregated Temporal Assembly Recurrent Networks for Weakly Supervised Multiple Action Detection

no code implementations • 19 Nov 2018 • Yunlu Xu, Chengwei Zhang, Zhanzhan Cheng, Jianwen Xie, Yi Niu, ShiLiang Pu, Fei Wu

Finally, we transform the output of recurrent neural network into the corresponding action distribution.

Multiple Action Detection

Paper
Add Code

Edit Probability for Scene Text Recognition

no code implementations • CVPR 2018 • Fan Bai, Zhanzhan Cheng, Yi Niu, ShiLiang Pu, Shuigeng Zhou

The advantage lies in that the training process can focus on the missing, superfluous and unrecognized characters, and thus the impact of the misalignment problem can be alleviated or even overcome.

Scene Text Recognition

Paper
Add Code

AON: Towards Arbitrarily-Oriented Text Recognition

1 code implementation • CVPR 2018 • Zhanzhan Cheng, Yangliu Xu, Fan Bai, Yi Niu, ShiLiang Pu, Shuigeng Zhou

Existing methods on text recognition mainly work with regular (horizontal and frontal) texts and cannot be trivially generalized to handle irregular texts.

Ranked #9 on Scene Text Recognition on ICDAR 2003

Optical Character Recognition Optical Character Recognition (OCR) +1

109

Paper
Code

Focusing Attention: Towards Accurate Text Recognition in Natural Images

no code implementations • ICCV 2017 • Zhanzhan Cheng, Fan Bai, Yunlu Xu, Gang Zheng, ShiLiang Pu, Shuigeng Zhou

FAN consists of two major components: an attention network (AN) that is responsible for recognizing character targets as in the existing methods, and a focusing network (FN) that is responsible for adjusting attention by evaluating whether AN pays attention properly on the target areas in the images.

Scene Text Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.