no code implementations • 17 Nov 2017 • Xiaodong Cun, Feng Xu, Chi-Man Pun, Hao Gao
In this paper, we focus on a more challenging and ill-posed problem that is to synthesize novel viewpoints from one single input image.
no code implementations • 13 Jan 2019 • Yatie Xiao, Chi-Man Pun, Jizhe Zhou
We focus our attention on the problem of generating adversarial perturbations based on the gradient in image classification domain
no code implementations • 1 Feb 2019 • Yatie Xiao, Chi-Man Pun
Deep Neural Networks have achieved remarkable success in computer vision, natural language processing, and audio tasks.
no code implementations • 26 Mar 2019 • Jizhe Zhou, Chi-Man Pun, YingYu Wang
This paper introduces an algorithm to protect the privacy of individuals in streaming video data by blurring faces such that face cannot be reliably recognized.
no code implementations • 12 Apr 2019 • Yatie Xiao, Chi-Man Pun
Deep neural networks are easily fooled high confidence predictions for adversarial samples
no code implementations • 4 Jan 2021 • Jizhe Zhou, Chi-Man Pun
On the video live streaming dataset we collected, FPVLS obtains satisfying accuracy, real-time efficiency, and contains the over-pixelation problems.
no code implementations • 3 Jan 2021 • Jizhe Zhou, Chi-Man Pun, Yu tong
A larger portion of fake news quotes untampered images from other sources with ulterior motives rather than conducting image forgery.
no code implementations • 3 Jan 2021 • Jizhe Zhou, Chi-Man Pun, Yu tong
With the prevailing of live video streaming, establishing an online pixelation method for privacy-sensitive objects is an urgency.
no code implementations • 7 Sep 2021 • Guan-Nan Dong, Chi-Man Pun, Zheng Zhang
To this end, we propose a novel deep collaborative multi-modal learning (DCML) to integrate the underlying information presented in facial properties in an adaptive manner to strengthen the facial details for effective unsupervised kinship verification.
no code implementations • 7 Sep 2021 • Guan-Nan Dong, Chi-Man Pun, Zheng Zhang
Specifically, we take parents and children as a whole to extract the expressive local and non-local features.
no code implementations • 21 Mar 2022 • Xiaodong Cun, Zhendong Wang, Chi-Man Pun, Jianzhuang Liu, Wengang Zhou, Xu Jia, Houqiang Li
Color constancy aims to restore the constant colors of a scene under different illuminants.
no code implementations • 27 May 2022 • Jingtang Liang, Chi-Man Pun
Our method attempts to bring together corresponding positive and negative samples by maximizing the mutual information between the foreground and background styles, which desirably makes our harmonization network more robust to discriminate the foreground and background style features when harmonizing composite images.
no code implementations • 23 Jul 2022 • Lizhen Long, Chi-Man Pun
To solve this problem, we introduce a novel arbitrary style transfer method with structure enhancement by combining the global and local loss.
no code implementations • 26 Jul 2022 • Wenyun Li, Chi-Man Pun
In addition, most of the existing methods choose to use an $n\times n$ similarity matrix for optimization, which makes the memory and computation unaffordable.
no code implementations • 16 Dec 2022 • Zinuo Li, Xuhang Chen, Chi-Man Pun, Shuqiang Wang
Image enhancement is a technique that frequently utilized in digital image processing.
no code implementations • 11 Mar 2023 • Xuhang Chen, Baiying Lei, Chi-Man Pun, Shuqiang Wang
Brain network analysis is essential for diagnosing and intervention for Alzheimer's disease (AD).
no code implementations • 10 Apr 2023 • Wenyun Li, Guo Zhong, Xingyu Lu, Chi-Man Pun
This article proposes a multiview hashing with learnable parameters to retrieve the queried images for a large-scale remote sensing dataset.
no code implementations • 18 May 2023 • Qiankun Zuo, Chi-Man Pun, Yudong Zhang, Hongfei Wang, Jin Hong
In this paper, a novel Multi-resolution Spatiotemporal Enhanced Transformer Denoising (MSETD) network with an adversarially functional diffusion model is proposed to map functional magnetic resonance imaging (fMRI) into effective connectivity for mild cognitive impairment (MCI) analysis.
no code implementations • 28 Jul 2023 • Shenghong Luo, Ruifeng Xu, Xuhang Chen, Zinuo Li, Chi-Man Pun, Shuqiang Wang
In this study, we propose the DocDeshadower, a multi-frequency Transformer-based model built on Laplacian Pyramid.
no code implementations • 4 Oct 2023 • Xuhang Chen, Chi-Man Pun, Shuqiang Wang
Within this framework, we introduce the Prompt Extraction Block and the Prompt Fusion Block to efficiently encode the cross-modal prompt.
no code implementations • 10 Oct 2023 • Xiaochen Ma, Jizhe Zhou, Xiong Xu, Zhuohang Jiang, Chi-Man Pun
While MAE has demonstrated an impressive understanding of object semantics, PMAE can also compensate for low-level semantics with our proposed enhancements.
no code implementations • 14 Nov 2023 • Wenyun Li, Chi-Man Pun
Glaucoma is a chronic neurodegenerative condition that can lead to blindness.
no code implementations • 2 Feb 2024 • Guanwen Feng, Haoran Cheng, Yunan Li, Zhiyuan Ma, Chaoneng Li, Zhihao Qian, Qiguang Miao, Chi-Man Pun
Additionally, we propose an emotion intensity control method using a fine-grained emotion matrix.
no code implementations • 7 Mar 2024 • Weihuang Liu, Xi Shen, Haolun Li, Xiuli Bi, Bo Liu, Chi-Man Pun, Xiaodong Cun
In this work, we introduce a test-time training (TTT) strategy to address the problem.
1 code implementation • 18 Aug 2023 • Lin Yuan, Guoheng Huang, Fenghuan Li, Xiaochen Yuan, Chi-Man Pun, Guo Zhong
This module can construct the interaction between different modalities and capture long-range contextual information based on similarity clusters.
1 code implementation • 31 Oct 2023 • Weiwen Chen, Yingtie Lei, Shenghong Luo, Ziyang Zhou, Mingxian Li, Chi-Man Pun
Underwater images often exhibit poor quality, distorted color balance and low contrast due to the complex and intricate interplay of light, water, and objects.
1 code implementation • 13 Sep 2023 • Weiwen Chen, Yingtie Lei, Shenghong Luo, Ziyang Zhou, Mingxian Li, Chi-Man Pun
The STD module employs a traditional thresholding technique and leverages the attention mechanism of the Transformer to gather global information, thereby enabling precise detection of shadow masks.
1 code implementation • 30 Dec 2023 • Lianyu Hu, Liqing Gao, Zekang Liu, Chi-Man Pun, Wei Feng
First, the prompts of the vision and language branches in these methods are usually separated or uni-directionally correlated.
1 code implementation • 12 May 2023 • Zewen Zheng, Guoheng Huang, Xiaochen Yuan, Chi-Man Pun, Hongrui Liu, Wing-Kuen Ling
In this paper, we introduce a quaternion perspective on correlation learning and propose a novel Quaternion-valued Correlation Learning Network (QCLNet), with the aim to alleviate the computational burden of high-dimensional correlation tensor and explore internal latent interaction between query and support images by leveraging operations defined by the established quaternion algebra.
Ranked #19 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)
1 code implementation • 16 Aug 2023 • Lianyu Hu, Liqing Gao, Zekang Liu, Chi-Man Pun, Wei Feng
Then these features are fed into a policy network to intelligently select a subsequence to process.
Ranked #7 on Sign Language Recognition on CSL-Daily
1 code implementation • 26 Aug 2023 • Shenghong Luo, Xuhang Chen, Weiwen Chen, Zinuo Li, Shuqiang Wang, Chi-Man Pun
Vignetting commonly occurs as a degradation in images resulting from factors such as lens design, improper lens hood usage, and limitations in camera sensors.
1 code implementation • 30 Nov 2022 • Xuhang Chen, Xiaodong Cun, Chi-Man Pun, Shuqiang Wang
Shadow removal improves the visual quality and legibility of digital copies of documents.
2 code implementations • 13 Sep 2021 • Jingtang Liang, Xiaodong Cun, Chi-Man Pun, Jue Wang
To this end, we propose a novel spatial-separated curve rendering network(S$^2$CRNet) for efficient and high-resolution image harmonization for the first time.
Ranked #12 on Image Harmonization on iHarmony4
1 code implementation • 21 Jan 2023 • Zinuo Li, Xuhang Chen, Shuqiang Wang, Chi-Man Pun
In order to facilitate film-based image stylization research, we construct FilmSet, a large-scale and high-quality film style dataset.
1 code implementation • 15 Jul 2019 • Xiaodong Cun, Chi-Man Pun
Thus, we address the problem of Image Harmonization: Given a spliced image and the mask of the spliced region, we try to harmonize the "style" of the pasted region with the background (non-spliced region).
Ranked #5 on Image Harmonization on HAdobe5k(1024$\times$1024)
1 code implementation • ECCV 2020 • Xiaodong Cun, Chi-Man Pun
In detail, we learn the defocus blur from ground truth and the depth distilled from a well-trained depth estimation network at the same time.
1 code implementation • 15 Mar 2023 • Weihuang Liu, Xiaodong Cun, Chi-Man Pun, Menghan Xia, Yong Zhang, Jue Wang
Thanks to the proposed structure, we only encode the high-resolution image in a relatively low resolution for larger reception field capturing.
1 code implementation • CVPR 2023 • Weihuang Liu, Xi Shen, Chi-Man Pun, Xiaodong Cun
Different from the previous visual prompting which is typically a dataset-level implicit embedding, our key insight is to enforce the tunable parameters focusing on the explicit visual content from each individual image, i. e., the features from frozen patch embeddings and the input's high-frequency components.
Ranked #1 on Salient Object Detection on DUT-OMRON
2 code implementations • 29 May 2023 • Weihuang Liu, Xi Shen, Chi-Man Pun, Xiaodong Cun
We take inspiration from the widely-used pre-training and then prompt tuning protocols in NLP and propose a new visual prompting model, named Explicit Visual Prompting (EVP).
Ranked #1 on Salient Object Detection on HKU-IS
1 code implementation • 26 Nov 2023 • Yudian Zheng, Xiaodong Cun, Menghan Xia, Chi-Man Pun
Understanding semantic intricacies and high-level concepts is essential in image sketch generation, and this challenge becomes even more formidable when applied to the domain of videos.
2 code implementations • ICCV 2023 • Zinuo Li, Xuhang Chen, Chi-Man Pun, Xiaodong Cun
We handle high-resolution document shadow removal directly via a larger-scale real-world dataset and a carefully designed frequency-aware network.
1 code implementation • 13 Dec 2020 • Xiaodong Cun, Chi-Man Pun
Simultaneously, to increase the robustness of watermark, attacking technique, such as watermark removal, also gets the attention from the community.
1 code implementation • 4 Dec 2023 • Jie Wang, Jiu-Cheng Xie, Xianyan Li, Feng Xu, Chi-Man Pun, Hao Gao
Constructing vivid 3D head avatars for given subjects and realizing a series of animations on them is valuable yet challenging.
5 code implementations • 20 Nov 2019 • Xiaodong Cun, Chi-Man Pun, Cheng Shi
With the help of novel masks or scenes, we enhance the current datasets using synthesized shadow images.
Ranked #2 on Shadow Removal on ISTD