1 code implementation • 15 Apr 2024 • Haojian Huang, Xiaozhen Qiao, Zhuo Chen, Haodong Chen, Bingyu Li, Zhe Sun, Mulin Chen, Xuelong Li
Zero-shot learning (ZSL) enables the recognition of novel classes by leveraging semantic knowledge transfer from known to unknown categories.
no code implementations • 25 Feb 2024 • Mulin Chen, Bocheng Wang, Xuelong Li
Graph Convolutional Network (GCN) has exhibited remarkable potential in improving graph-based clustering.
no code implementations • 17 Nov 2023 • Chuang Yang, Kai Zhuang, Mulin Chen, Haozhao Ma, Xu Han, Tao Han, Changxing Guo, Han Han, Bingxuan Zhao, Qi Wang
Following the above issues, we propose a traffic sign interpretation (TSI) task, which aims to interpret global semantic interrelated traffic signs (e. g.,~driving instruction-related texts, symbols, and guide panels) into a natural language for providing accurate instruction support to autonomous or assistant driving.
no code implementations • CVPR 2023 • Weichuang Li, Longhao Zhang, Dong Wang, Bin Zhao, Zhigang Wang, Mulin Chen, Bang Zhang, Zhongjian Wang, Liefeng Bo, Xuelong Li
Talking head generation aims to generate faces that maintain the identity information of the source image and imitate the motion of the driving image.
no code implementations • CVPR 2023 • Yihao Wang, Zhigang Wang, Bin Zhao, Dong Wang, Mulin Chen, Xuelong Li
In contrast, we propose a purely passive method to track a person walking in an invisible room by only observing a relay wall, which is more in line with real application scenarios, e. g., security.
1 code implementation • CVPR 2023 • Haozhe Si, Bin Zhao, Dong Wang, Yunpeng Gao, Mulin Chen, Zhigang Wang, Xuelong Li
We show that our framework circumvents the needs for the depth and AIF image ground-truth, and receives superior predictions, thus closing the gap between the theoretical success of DFD works and their applications in the real world.
no code implementations • 18 Nov 2021 • Chuang Yang, Mulin Chen, Yuan Yuan, Qi Wang, Xuelong Li
It weakens the coupling of texts to shrink-masks, which improves the robustness of detection results.
no code implementations • 12 May 2021 • Chuang Yang, Mulin Chen, Yuan Yuan, Qi Wang
Text detection, the key technology for understanding scene text, has become an attractive research topic.
no code implementations • 24 Apr 2021 • Qi Wang, Yanling Miao, Mulin Chen, Xuelong Li
In order to better handle the high dimensionality problem and preserve the spatial structures, this paper proposes a novel unsupervised approach called spatial-spectral clustering with anchor graph (SSCAG) for HSI data clustering.
no code implementations • 11 Apr 2021 • Qi Wang, Xu Jiang, Mulin Chen, Xuelong Li
In this paper, we focus on the unsupervised multi-view feature selection which tries to handle high dimensional data in the field of multi-view learning.
no code implementations • 11 Apr 2021 • Chuang Yang, Mulin Chen, Yuan Yuan, Qi Wang
Specifically, a new text representation strategy is proposed to represent text contours from a top-down perspective, which can fit highly curved text contours effectively.
no code implementations • 25 Mar 2021 • Yanling Miao, Qi Wang, Mulin Chen, Xuelong Li
Graph-based semi-supervised learning methods, which deal well with the situation of limited labeled data, have shown dominant performance in practical applications.
no code implementations • 24 Mar 2021 • Mulin Chen, Xuelong Li
Considering that the outliers are usually much less than the normal samples, a new entropy loss function is established for matrix factorization, which minimizes the entropy of the residue distribution and allows a few samples to have large approximation errors.
no code implementations • 24 Mar 2021 • Mulin Chen, Maoguo Gong, Xuelong Li
Non-negative Matrix Factorization (NMF) is one of the most popular techniques for data representation and clustering, and has been widely used in machine learning and data analysis.
no code implementations • 17 Mar 2021 • Bo Wei, Mulin Chen, Qi Wang, Xuelong Li
To obtain the accurate supervision information of different channels, the MDSNet employs an auxiliary network called SupervisionNet (SN) to generate abundant supervision maps based on existing groundtruth.
no code implementations • 30 Nov 2020 • Chuang Yang, Mulin Chen, Zhitong Xiong, Yuan Yuan, Qi Wang
Extensive experiments demonstrate the proposed CM is efficient and robust to fit arbitrary-shaped text instances, and also validate the effectiveness of MPF and constraints loss for discriminative text features recognition.