no code implementations • 8 Apr 2024 • Xiahan Chen, Mingjian Chen, Sanli Tang, Yi Niu, Jiang Zhu
3D object detection based on roadside cameras is an additional way for autonomous driving to alleviate the challenges of occlusion and short perception range from vehicle cameras.
no code implementations • 19 Sep 2023 • Chang Liu, Yi Niu, Mingming Ma, Fu Li, Guangming Shi
The basic principle of the patch-matching based style transfer is to substitute the patches of the content image feature maps by the closest patches from the style image feature maps.
no code implementations • 19 Sep 2023 • Chang Liu, Giulia Rizzoli, Francesco Barbato, Andrea Maracani, Marco Toldo, Umberto Michieli, Yi Niu, Pietro Zanuttigh
Catastrophic forgetting of previous knowledge is a critical issue in continual learning typically handled through various regularization strategies.
1 code implementation • CVPR 2023 • Linglan Zhao, Jing Lu, Yunlu Xu, Zhanzhan Cheng, Dashan Guo, Yi Niu, Xiangzhong Fang
While knowledge distillation, a prevailing technique in CIL, can alleviate the catastrophic forgetting of older classes by regularizing outputs between current and previous model, it fails to consider the overfitting risk of novel classes in FSCIL.
no code implementations • CVPR 2023 • Beitong Zhou, Jing Lu, Kerui Liu, Yunlu Xu, Zhanzhan Cheng, Yi Niu
Recent developments of the application of Contrastive Learning in Semi-Supervised Learning (SSL) have demonstrated significant advancements, as a result of its exceptional ability to learn class-aware cluster representations and the full exploitation of massive unlabeled data.
1 code implementation • 17 Oct 2022 • Sanli Tang, Zhongyu Zhang, Zhanzhan Cheng, Jing Lu, Yunlu Xu, Yi Niu, Fan He
Then, a robust distilling module (RDM) is applied to construct the global knowledge based on the prototypes and filtrate noisy global and local knowledge by measuring the discrepancy of the representations in two feature spaces.
no code implementations • 14 Jul 2022 • Zhanzhan Cheng, Peng Zhang, Can Li, Qiao Liang, Yunlu Xu, Pengfei Li, ShiLiang Pu, Yi Niu, Fei Wu
Most existing methods divide this task into two subparts: the text reading part for obtaining the plain text from the original document images and the information extraction part for extracting key contents.
1 code implementation • 14 Jul 2022 • Ying Chen, Liang Qiao, Zhanzhan Cheng, ShiLiang Pu, Yi Niu, Xi Li
In this paper, to address this problem, we propose a novel cost-efficient Dynamic Low-resolution Distillation (DLD) text spotting framework, which aims to infer images in different small but recognizable resolutions and achieve a better balance between accuracy and efficiency.
Knowledge Distillation Optical Character Recognition (OCR) +1
1 code implementation • 14 Jul 2022 • Liang Qiao, Hui Jiang, Ying Chen, Can Li, Pengfei Li, Zaisheng Li, Baorui Zou, Dashan Guo, Yingda Xu, Yunlu Xu, Zhanzhan Cheng, Yi Niu
Compared with the previous opensource OCR toolbox, DavarOCR has relatively more complete support for the sub-tasks of the cutting-edge technology of document understanding.
no code implementations • 14 Jul 2022 • Guimei Cao, Zhanzhan Cheng, Yunlu Xu, Duo Li, ShiLiang Pu, Yi Niu, Fei Wu
In this paper, we propose an end-to-end trainable adaptively expandable network named E2-AEN, which dynamically generates lightweight structures for new tasks without any accuracy drop in previous tasks.
1 code implementation • 12 Apr 2022 • Yang Li, Ji Chen, Fu Li, Boxun Fu, Hao Wu, Youshuo Ji, Yijin Zhou, Yi Niu, Guangming Shi, Wenming Zheng
GMSS has the ability to learn more general representations by integrating multiple self-supervised tasks, including spatial and frequency jigsaw puzzle tasks, and contrastive learning tasks.
no code implementations • 16 Mar 2022 • Jing Lu, Yunxu Xu, Hao Li, Zhanzhan Cheng, Yi Niu
Accordingly, the embedding space can be better optimized to discriminate therein the predefined classes and between known and unknowns.
no code implementations • 13 Jan 2022 • Duo Li, Guimei Cao, Yunlu Xu, Zhanzhan Cheng, Yi Niu
In the SSLAD-Track 3B challenge on continual learning, we propose the method of COntinual Learning with Transformer (COLT).
no code implementations • 21 Oct 2021 • Linlan Zhao, Dashan Guo, Yunlu Xu, Liang Qiao, Zhanzhan Cheng, ShiLiang Pu, Yi Niu, Xiangzhong Fang
Few-shot learning (FSL) aims to learn models that generalize to novel classes with limited training samples.
1 code implementation • 13 May 2021 • Peng Zhang, Can Li, Liang Qiao, Zhanzhan Cheng, ShiLiang Pu, Yi Niu, Fei Wu
To address the above limitations, we propose a unified framework VSR for document layout analysis, combining vision, semantics and relations.
Ranked #3 on Document Layout Analysis on PubLayNet val
1 code implementation • 13 May 2021 • Liang Qiao, Zaisheng Li, Zhanzhan Cheng, Peng Zhang, ShiLiang Pu, Yi Niu, Wenqi Ren, Wenming Tan, Fei Wu
In this paper, we aim to obtain more reliable aligned bounding boxes by fully utilizing the visual information from both text regions in proposed local features and cell relations in global features.
Ranked #7 on Table Recognition on PubTabNet
1 code implementation • 13 May 2021 • Hui Jiang, Yunlu Xu, Zhanzhan Cheng, ShiLiang Pu, Yi Niu, Wenqi Ren, Fei Wu, Wenming Tan
In this work, we excavate the implicit task, character counting within the traditional text recognition, without additional labor annotation cost.
no code implementations • 8 Apr 2021 • Qiyao Wang, Pengfei Li, Li Zhu, Yi Niu
For the text spotting task, we detect the characters on integrated circuit and classify them based on yolov5 detection model.
no code implementations • 1 Jan 2021 • Duo Li, Sanli Tang, Zhanzhan Cheng, ShiLiang Pu, Yi Niu, Wenming Tan, Fei Wu, Xiaokang Yang
However, the impact of the pseudo-labeled samples' quality as well as the mining strategies for high quality training sample have rarely been studied in SSL.
1 code implementation • 8 Dec 2020 • Liang Qiao, Ying Chen, Zhanzhan Cheng, Yunlu Xu, Yi Niu, ShiLiang Pu, Fei Wu
Recently end-to-end scene text spotting has become a popular research topic due to its advantages of global optimization and high maintainability in real applications.
Ranked #6 on Text Spotting on SCUT-CTW1500
no code implementations • 16 Nov 2020 • Yang Jiao, Yi Niu, Trac D. Tran, Guangming Shi
In 2D+3D facial expression recognition (FER), existing methods generate multi-view geometry maps to enhance the depth feature representation.
3D Facial Expression Recognition Facial Expression Recognition
no code implementations • 6 Jul 2020 • Sanli Tang, Zhanzhan Cheng, ShiLiang Pu, Dashan Guo, Yi Niu, Fei Wu
To tackle this issue, we develop a fine-grained domain alignment approach with a well-designed domain classifier bank that achieves the instance-level alignment respecting to their categories.
no code implementations • 22 Jun 2020 • Jinghuang Lin, Zhanzhan Cheng, Fan Bai, Yi Niu, ShiLiang Pu, Shuigeng Zhou
Scene text recognition (STR) is still a hot research topic in computer vision field due to its various applications.
no code implementations • 27 May 2020 • Jing Lu, Baorui Zou, Zhanzhan Cheng, ShiLiang Pu, Shuigeng Zhou, Yi Niu, Fei Wu
In this paper, we define the problem of object quality assessment for the first time and propose an effective approach named Object-QA to assess high-reliable quality scores for object images.
1 code implementation • 27 May 2020 • Peng Zhang, Yunlu Xu, Zhanzhan Cheng, ShiLiang Pu, Jing Lu, Liang Qiao, Yi Niu, Fei Wu
Since real-world ubiquitous documents (e. g., invoices, tickets, resumes and leaflets) contain rich information, automatic document image understanding has become a hot topic.
3 code implementations • 27 May 2020 • Chengwei Zhang, Yunlu Xu, Zhanzhan Cheng, ShiLiang Pu, Yi Niu, Fei Wu, Futai Zou
Arbitrary text appearance poses a great challenge in scene text recognition tasks.
no code implementations • 26 Feb 2020 • Zhanzhan Cheng, Yunlu Xu, Mingjian Cheng, Yu Qiao, ShiLiang Pu, Yi Niu, Fei Wu
Recurrent neural network (RNN) has been widely studied in sequence learning tasks, while the mainstream models (e. g., LSTM and GRU) rely on the gating mechanism (in control of how information flows between hidden states).
1 code implementation • 17 Feb 2020 • Liang Qiao, Sanli Tang, Zhanzhan Cheng, Yunlu Xu, Yi Niu, ShiLiang Pu, Fei Wu
Many approaches have recently been proposed to detect irregular scene text and achieved promising results.
Ranked #8 on Text Spotting on SCUT-CTW1500
no code implementations • 7 Aug 2019 • Chengwei Zhang, Yunlu Xu, Zhanzhan Cheng, Yi Niu, ShiLiang Pu, Fei Wu, Futai Zou
The second module is a specific classifier for mining trivial or incomplete action regions, which is trained on the shared features after erasing the seeded regions activated by SSG.
Action Detection Weakly-supervised Temporal Action Localization +1
no code implementations • 6 Aug 2019 • Peng Zhang, Xinyu Zhu, Zhanzhan Cheng, Shuigeng Zhou, Yi Niu
Fine-grained image recognition has been a hot research topic in computer vision due to its various applications.
1 code implementation • 8 Mar 2019 • Zhanzhan Cheng, Jing Lu, Yi Niu, ShiLiang Pu, Fei Wu, Shuigeng Zhou
Video text spotting is still an important research topic due to its various real-applications.
no code implementations • 19 Nov 2018 • Yunlu Xu, Chengwei Zhang, Zhanzhan Cheng, Jianwen Xie, Yi Niu, ShiLiang Pu, Fei Wu
Finally, we transform the output of recurrent neural network into the corresponding action distribution.
no code implementations • CVPR 2018 • Fan Bai, Zhanzhan Cheng, Yi Niu, ShiLiang Pu, Shuigeng Zhou
The advantage lies in that the training process can focus on the missing, superfluous and unrecognized characters, and thus the impact of the misalignment problem can be alleviated or even overcome.
1 code implementation • CVPR 2018 • Zhanzhan Cheng, Yangliu Xu, Fan Bai, Yi Niu, ShiLiang Pu, Shuigeng Zhou
Existing methods on text recognition mainly work with regular (horizontal and frontal) texts and cannot be trivially generalized to handle irregular texts.
Ranked #9 on Scene Text Recognition on ICDAR 2003
Optical Character Recognition Optical Character Recognition (OCR) +1