1 code implementation • 12 Sep 2024 • Fuchen Zheng, Quanjun Li, Weixuan Li, Xuhang Chen, Yihang Dong, Guoheng Huang, Chi-Man Pun, Shoujun Zhou
Overall, our results indicate that CMAformer, combined with the feature fusion framework and the new consistency loss, demonstrates strong complementarity in semi-supervised learning ensembles.
1 code implementation • 12 Sep 2024 • Fuchen Zheng, Xinyi Chen, Xuhang Chen, Haolun Li, Xiaojiao Guo, Guoheng Huang, Chi-Man Pun, Shoujun Zhou
To address this limitation, we propose the Adaptive Semantic Segmentation Network (ASSNet), a transformer architecture that effectively integrates local and global features for precise medical image segmentation.
1 code implementation • 2 Sep 2024 • Kangdao Liu, Tianhao Sun, Hao Zeng, Yongshan Zhang, Chi-Man Pun, Chi-Man Vong
Hyperspectral image (HSI) classification involves assigning specific labels to each pixel to identify various land cover categories.
1 code implementation • 31 Aug 2024 • Fuchen Zheng, Xuhang Chen, Weihuang Liu, Haolun Li, Yingtie Lei, Jiahui He, Chi-Man Pun, Shounjun Zhou
First, a Synergistic Multi-Attention (SMA) Transformer block is proposed, which has the benefits of Pixel Attention, Channel Attention, and Spatial Attention for feature enrichment.
1 code implementation • 20 Aug 2024 • Yingtie Lei, JIA YU, Yihang Dong, Changwei Gong, Ziyang Zhou, Chi-Man Pun
By explicitly incorporating color priors and modeling the physical characteristics of underwater image formation, the proposed DUN model achieves more accurate and reliable enhancement results.
1 code implementation • 17 Jul 2024 • Xiaojiao Guo, Xuhang Chen, Shenghong Luo, Shuqiang Wang, Chi-Man Pun
Specular highlight removal plays a pivotal role in multimedia applications, as it enhances the quality and interpretability of images and videos, ultimately improving the performance of downstream tasks such as content-based retrieval, object recognition, and scene understanding.
1 code implementation • 26 Jun 2024 • Yiguo Jiang, Xuhang Chen, Chi-Man Pun, Shuqiang Wang, Wei Feng
The low-frequency part typically contains illumination information, while the high-frequency part contains detailed content information.
1 code implementation • 15 Jun 2024 • Xiaochen Ma, Xuekang Zhu, Lei Su, Bo Du, Zhuohang Jiang, Bingkui Tong, Zeyu Lei, Xinyu Yang, Chi-Man Pun, Jiancheng Lv, Jizhe Zhou
A comprehensive benchmark is yet to be established in the Image Manipulation Detection \& Localization (IMDL) field.
1 code implementation • 5 Jun 2024 • Xiuli Bi, Zonglin Yang, Bo Liu, Xiaodong Cun, Chi-Man Pun, Pietro Lio, Bin Xiao
In this work, we suppose that adversarial images are outliers of the natural image manifold and the purification process can be considered as returning them to this manifold.
no code implementations • Proceedings of the AAAI Conference on Artificial Intelligence 2024 • Zewen Zheng, Xuemin Zhang, Yongqiang Mou, Xiang Gao, Chengxin Li, Guoheng Huang, Chi-Man Pun, Xiaochen Yuan
Monocular 3D lane detection is essential for a reliable autonomous driving system and has recently been rapidly developing.
Ranked #1 on 3D Lane Detection on OpenLane
1 code implementation • CVPR 2024 • Weihuang Liu, Xi Shen, Haolun Li, Xiuli Bi, Bo Liu, Chi-Man Pun, Xiaodong Cun
In this work, we introduce a test-time training (TTT) strategy to address the problem.
no code implementations • 2 Feb 2024 • Guanwen Feng, Haoran Cheng, Yunan Li, Zhiyuan Ma, Chaoneng Li, Zhihao Qian, Qiguang Miao, Chi-Man Pun
Additionally, we propose an emotion intensity control method using a fine-grained emotion matrix.
1 code implementation • 30 Dec 2023 • Lianyu Hu, Liqing Gao, Zekang Liu, Chi-Man Pun, Wei Feng
First, the prompts of the vision and language branches in these methods are usually separated or uni-directionally correlated.
1 code implementation • 4 Dec 2023 • Jie Wang, Jiu-Cheng Xie, Xianyan Li, Feng Xu, Chi-Man Pun, Hao Gao
Constructing vivid 3D head avatars for given subjects and realizing a series of animations on them is valuable yet challenging.
1 code implementation • 26 Nov 2023 • Yudian Zheng, Xiaodong Cun, Menghan Xia, Chi-Man Pun
Understanding semantic intricacies and high-level concepts is essential in image sketch generation, and this challenge becomes even more formidable when applied to the domain of videos.
no code implementations • 14 Nov 2023 • Wenyun Li, Chi-Man Pun
Glaucoma is a chronic neurodegenerative condition that can lead to blindness.
1 code implementation • 31 Oct 2023 • Weiwen Chen, Yingtie Lei, Shenghong Luo, Ziyang Zhou, Mingxian Li, Chi-Man Pun
Underwater images often exhibit poor quality, distorted color balance and low contrast due to the complex and intricate interplay of light, water, and objects.
no code implementations • 10 Oct 2023 • Xiaochen Ma, Jizhe Zhou, Xiong Xu, Zhuohang Jiang, Chi-Man Pun
While MAE has demonstrated an impressive understanding of object semantics, PMAE can also compensate for low-level semantics with our proposed enhancements.
no code implementations • 4 Oct 2023 • Xuhang Chen, Chi-Man Pun, Shuqiang Wang
Within this framework, we introduce the Prompt Extraction Block and the Prompt Fusion Block to efficiently encode the cross-modal prompt.
1 code implementation • 13 Sep 2023 • Weiwen Chen, Yingtie Lei, Shenghong Luo, Ziyang Zhou, Mingxian Li, Chi-Man Pun
The STD module employs a traditional thresholding technique and leverages the attention mechanism of the Transformer to gather global information, thereby enabling precise detection of shadow masks.
2 code implementations • ICCV 2023 • Zinuo Li, Xuhang Chen, Chi-Man Pun, Xiaodong Cun
We handle high-resolution document shadow removal directly via a larger-scale real-world dataset and a carefully designed frequency-aware network.
1 code implementation • 26 Aug 2023 • Shenghong Luo, Xuhang Chen, Weiwen Chen, Zinuo Li, Shuqiang Wang, Chi-Man Pun
Vignetting commonly occurs as a degradation in images resulting from factors such as lens design, improper lens hood usage, and limitations in camera sensors.
1 code implementation • 18 Aug 2023 • Lin Yuan, Guoheng Huang, Fenghuan Li, Xiaochen Yuan, Chi-Man Pun, Guo Zhong
This module can construct the interaction between different modalities and capture long-range contextual information based on similarity clusters.
1 code implementation • 16 Aug 2023 • Lianyu Hu, Liqing Gao, Zekang Liu, Chi-Man Pun, Wei Feng
Then these features are fed into a policy network to intelligently select a subsequence to process.
Ranked #8 on Sign Language Recognition on CSL-Daily
1 code implementation • 28 Jul 2023 • Ziyang Zhou, Yingtie Lei, Xuhang Chen, Shenghong Luo, Wenjun Zhang, Chi-Man Pun, Zhen Wang
Shadows in scanned documents pose significant challenges for document analysis and recognition tasks due to their negative impact on visual quality and readability.
2 code implementations • 29 May 2023 • Weihuang Liu, Xi Shen, Chi-Man Pun, Xiaodong Cun
We take inspiration from the widely-used pre-training and then prompt tuning protocols in NLP and propose a new visual prompting model, named Explicit Visual Prompting (EVP).
Ranked #2 on Salient Object Detection on DUT-OMRON
no code implementations • 18 May 2023 • Qiankun Zuo, Hao Tian, Chi-Man Pun, Hongfei Wang, Yudong Zhang, Jin Hong
To be specific, the proposed BIGG framework is based on the diffusion denoising probabilistic models (DDPM), where each denoising step is modeled as a generative adversarial network (GAN) to progressively translate the noise and conditional fMRI to effective connectivity.
1 code implementation • 12 May 2023 • Zewen Zheng, Guoheng Huang, Xiaochen Yuan, Chi-Man Pun, Hongrui Liu, Wing-Kuen Ling
In this paper, we introduce a quaternion perspective on correlation learning and propose a novel Quaternion-valued Correlation Learning Network (QCLNet), with the aim to alleviate the computational burden of high-dimensional correlation tensor and explore internal latent interaction between query and support images by leveraging operations defined by the established quaternion algebra.
Ranked #20 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)
no code implementations • 10 Apr 2023 • Wenyun Li, Guo Zhong, Xingyu Lu, Chi-Man Pun
This article proposes a multiview hashing with learnable parameters to retrieve the queried images for a large-scale remote sensing dataset.
1 code implementation • CVPR 2023 • Weihuang Liu, Xi Shen, Chi-Man Pun, Xiaodong Cun
Different from the previous visual prompting which is typically a dataset-level implicit embedding, our key insight is to enforce the tunable parameters focusing on the explicit visual content from each individual image, i. e., the features from frozen patch embeddings and the input's high-frequency components.
Ranked #3 on Salient Object Detection on HKU-IS
1 code implementation • 15 Mar 2023 • Weihuang Liu, Xiaodong Cun, Chi-Man Pun, Menghan Xia, Yong Zhang, Jue Wang
Thanks to the proposed structure, we only encode the high-resolution image in a relatively low resolution for larger reception field capturing.
no code implementations • 11 Mar 2023 • Xuhang Chen, Baiying Lei, Chi-Man Pun, Shuqiang Wang
Brain network analysis is essential for diagnosing and intervention for Alzheimer's disease (AD).
1 code implementation • 21 Jan 2023 • Zinuo Li, Xuhang Chen, Shuqiang Wang, Chi-Man Pun
In order to facilitate film-based image stylization research, we construct FilmSet, a large-scale and high-quality film style dataset.
no code implementations • 16 Dec 2022 • Zinuo Li, Xuhang Chen, Chi-Man Pun, Shuqiang Wang
Image enhancement is a technique that frequently utilized in digital image processing.
1 code implementation • 30 Nov 2022 • Xuhang Chen, Xiaodong Cun, Chi-Man Pun, Shuqiang Wang
Shadow removal improves the visual quality and legibility of digital copies of documents.
no code implementations • 26 Jul 2022 • Wenyun Li, Chi-Man Pun
In addition, most of the existing methods choose to use an $n\times n$ similarity matrix for optimization, which makes the memory and computation unaffordable.
no code implementations • 23 Jul 2022 • Lizhen Long, Chi-Man Pun
To solve this problem, we introduce a novel arbitrary style transfer method with structure enhancement by combining the global and local loss.
no code implementations • 27 May 2022 • Jingtang Liang, Chi-Man Pun
Our method attempts to bring together corresponding positive and negative samples by maximizing the mutual information between the foreground and background styles, which desirably makes our harmonization network more robust to discriminate the foreground and background style features when harmonizing composite images.
no code implementations • 21 Mar 2022 • Xiaodong Cun, Zhendong Wang, Chi-Man Pun, Jianzhuang Liu, Wengang Zhou, Xu Jia, Houqiang Li
Color constancy aims to restore the constant colors of a scene under different illuminants.
2 code implementations • 13 Sep 2021 • Jingtang Liang, Xiaodong Cun, Chi-Man Pun, Jue Wang
To this end, we propose a novel spatial-separated curve rendering network(S$^2$CRNet) for efficient and high-resolution image harmonization for the first time.
Ranked #12 on Image Harmonization on iHarmony4
no code implementations • 7 Sep 2021 • Guan-Nan Dong, Chi-Man Pun, Zheng Zhang
To this end, we propose a novel deep collaborative multi-modal learning (DCML) to integrate the underlying information presented in facial properties in an adaptive manner to strengthen the facial details for effective unsupervised kinship verification.
no code implementations • 7 Sep 2021 • Guan-Nan Dong, Chi-Man Pun, Zheng Zhang
Specifically, we take parents and children as a whole to extract the expressive local and non-local features.
no code implementations • 4 Jan 2021 • Jizhe Zhou, Chi-Man Pun
On the video live streaming dataset we collected, FPVLS obtains satisfying accuracy, real-time efficiency, and contains the over-pixelation problems.
no code implementations • 3 Jan 2021 • Jizhe Zhou, Chi-Man Pun, Yu tong
With the prevailing of live video streaming, establishing an online pixelation method for privacy-sensitive objects is an urgency.
no code implementations • 3 Jan 2021 • Jizhe Zhou, Chi-Man Pun, Yu tong
A larger portion of fake news quotes untampered images from other sources with ulterior motives rather than conducting image forgery.
1 code implementation • 13 Dec 2020 • Xiaodong Cun, Chi-Man Pun
Simultaneously, to increase the robustness of watermark, attacking technique, such as watermark removal, also gets the attention from the community.
1 code implementation • ECCV 2020 • Xiaodong Cun, Chi-Man Pun
In detail, we learn the defocus blur from ground truth and the depth distilled from a well-trained depth estimation network at the same time.
5 code implementations • 20 Nov 2019 • Xiaodong Cun, Chi-Man Pun, Cheng Shi
With the help of novel masks or scenes, we enhance the current datasets using synthesized shadow images.
Ranked #4 on Shadow Removal on ISTD
1 code implementation • 15 Jul 2019 • Xiaodong Cun, Chi-Man Pun
Thus, we address the problem of Image Harmonization: Given a spliced image and the mask of the spliced region, we try to harmonize the "style" of the pasted region with the background (non-spliced region).
Ranked #5 on Image Harmonization on HAdobe5k(1024$\times$1024)
no code implementations • 12 Apr 2019 • Yatie Xiao, Chi-Man Pun
Deep neural networks are easily fooled high confidence predictions for adversarial samples
no code implementations • 26 Mar 2019 • Jizhe Zhou, Chi-Man Pun, YingYu Wang
This paper introduces an algorithm to protect the privacy of individuals in streaming video data by blurring faces such that face cannot be reliably recognized.
no code implementations • 1 Feb 2019 • Yatie Xiao, Chi-Man Pun
Deep Neural Networks have achieved remarkable success in computer vision, natural language processing, and audio tasks.
no code implementations • 13 Jan 2019 • Yatie Xiao, Chi-Man Pun, Jizhe Zhou
We focus our attention on the problem of generating adversarial perturbations based on the gradient in image classification domain
no code implementations • 17 Nov 2017 • Xiaodong Cun, Feng Xu, Chi-Man Pun, Hao Gao
In this paper, we focus on a more challenging and ill-posed problem that is to synthesize novel viewpoints from one single input image.