Search Results for author: Chi-Man Pun

Found 44 papers, 20 papers with code

Depth Assisted Full Resolution Network for Single Image-based View Synthesis

no code implementations • 17 Nov 2017 • Xiaodong Cun, Feng Xu, Chi-Man Pun, Hao Gao

In this paper, we focus on a more challenging and ill-posed problem that is to synthesize novel viewpoints from one single input image.

Depth Estimation

Paper
Add Code

Generating Adversarial Perturbation with Root Mean Square Gradient

no code implementations • 13 Jan 2019 • Yatie Xiao, Chi-Man Pun, Jizhe Zhou

We focus our attention on the problem of generating adversarial perturbations based on the gradient in image classification domain

Classification General Classification +1

Paper
Add Code

Adaptive Gradient for Adversarial Perturbations Generation

no code implementations • 1 Feb 2019 • Yatie Xiao, Chi-Man Pun

Deep Neural Networks have achieved remarkable success in computer vision, natural language processing, and audio tasks.

Image Classification

Paper
Add Code

Pixelation is NOT Done in Videos Yet

no code implementations • 26 Mar 2019 • Jizhe Zhou, Chi-Man Pun, YingYu Wang

This paper introduces an algorithm to protect the privacy of individuals in streaming video data by blurring faces such that face cannot be reliably recognized.

Clustering Face Detection +2

Paper
Add Code

Generating Minimal Adversarial Perturbations with Integrated Adaptive Gradients

no code implementations • 12 Apr 2019 • Yatie Xiao, Chi-Man Pun

Deep neural networks are easily fooled high confidence predictions for adversarial samples

Image Classification

Paper
Add Code

Personal Privacy Protection via Irrelevant Faces Tracking and Pixelation in Video Live Streaming

no code implementations • 4 Jan 2021 • Jizhe Zhou, Chi-Man Pun

On the video live streaming dataset we collected, FPVLS obtains satisfying accuracy, real-time efficiency, and contains the over-pixelation problems.

Clustering Face Detection

Paper
Add Code

News Image Steganography: A Novel Architecture Facilitates the Fake News Identification

no code implementations • 3 Jan 2021 • Jizhe Zhou, Chi-Man Pun, Yu tong

A larger portion of fake news quotes untampered images from other sources with ulterior motives rather than conducting image forgery.

Extractive Summarization Image Steganography

Paper
Add Code

Privacy-sensitive Objects Pixelation for Live Video Streaming

no code implementations • 3 Jan 2021 • Jizhe Zhou, Chi-Man Pun, Yu tong

With the prevailing of live video streaming, establishing an online pixelation method for privacy-sensitive objects is an urgency.

Clustering

Paper
Add Code

Deep Collaborative Multi-Modal Learning for Unsupervised Kinship Estimation

no code implementations • 7 Sep 2021 • Guan-Nan Dong, Chi-Man Pun, Zheng Zhang

To this end, we propose a novel deep collaborative multi-modal learning (DCML) to integrate the underlying information presented in facial properties in an adaptive manner to strengthen the facial details for effective unsupervised kinship verification.

Face Recognition Kinship Verification

Paper
Add Code

Kinship Verification Based on Cross-Generation Feature Interaction Learning

no code implementations • 7 Sep 2021 • Guan-Nan Dong, Chi-Man Pun, Zheng Zhang

Specifically, we take parents and children as a whole to extract the expressive local and non-local features.

Kinship Verification

Paper
Add Code

Learning Enriched Illuminants for Cross and Single Sensor Color Constancy

no code implementations • 21 Mar 2022 • Xiaodong Cun, Zhendong Wang, Chi-Man Pun, Jianzhuang Liu, Wengang Zhou, Xu Jia, Houqiang Li

Color constancy aims to restore the constant colors of a scene under different illuminants.

Color Constancy

Paper
Add Code

Image Harmonization with Region-wise Contrastive Learning

no code implementations • 27 May 2022 • Jingtang Liang, Chi-Man Pun

Our method attempts to bring together corresponding positive and negative samples by maximizing the mutual information between the foreground and background styles, which desirably makes our harmonization network more robust to discriminate the foreground and background style features when harmonizing composite images.

Contrastive Learning Image Harmonization

Paper
Add Code

Arbitrary Style Transfer with Structure Enhancement by Combining the Global and Local Loss

no code implementations • 23 Jul 2022 • Lizhen Long, Chi-Man Pun

To solve this problem, we introduce a novel arbitrary style transfer method with structure enhancement by combining the global and local loss.

Classification Style Transfer

Paper
Add Code

Asymmetric Scalable Cross-modal Hashing

no code implementations • 26 Jul 2022 • Wenyun Li, Chi-Man Pun

In addition, most of the existing methods choose to use an $n\times n$ similarity matrix for optimization, which makes the memory and computation unaffordable.

Retrieval

Paper
Add Code

WavEnhancer: Unifying Wavelet and Transformer for Image Enhancement

no code implementations • 16 Dec 2022 • Zinuo Li, Xuhang Chen, Chi-Man Pun, Shuqiang Wang

Image enhancement is a technique that frequently utilized in digital image processing.

Image Enhancement

Paper
Add Code

Brain Diffuser: An End-to-End Brain Image to Brain Network Pipeline

no code implementations • 11 Mar 2023 • Xuhang Chen, Baiying Lei, Chi-Man Pun, Shuqiang Wang

Brain network analysis is essential for diagnosing and intervention for Alzheimer's disease (AD).

Paper
Add Code

Locality Preserving Multiview Graph Hashing for Large Scale Remote Sensing Image Search

no code implementations • 10 Apr 2023 • Wenyun Li, Guo Zhong, Xingyu Lu, Chi-Man Pun

This article proposes a multiview hashing with learnable parameters to retrieve the queried images for a large-scale remote sensing dataset.

Image Retrieval

Paper
Add Code

Multi-resolution Spatiotemporal Enhanced Transformer Denoising with Functional Diffusive GANs for Constructing Brain Effective Connectivity in MCI analysis

no code implementations • 18 May 2023 • Qiankun Zuo, Chi-Man Pun, Yudong Zhang, Hongfei Wang, Jin Hong

In this paper, a novel Multi-resolution Spatiotemporal Enhanced Transformer Denoising (MSETD) network with an adversarially functional diffusion model is proposed to map functional magnetic resonance imaging (fMRI) into effective connectivity for mild cognitive impairment (MCI) analysis.

Denoising Time Series

Paper
Add Code

DocDeshadower: Frequency-aware Transformer for Document Shadow Removal

no code implementations • 28 Jul 2023 • Shenghong Luo, Ruifeng Xu, Xuhang Chen, Zinuo Li, Chi-Man Pun, Shuqiang Wang

In this study, we propose the DocDeshadower, a multi-frequency Transformer-based model built on Laplacian Pyramid.

Document Shadow Removal

Paper
Add Code

MedPrompt: Cross-Modal Prompting for Multi-Task Medical Image Translation

no code implementations • 4 Oct 2023 • Xuhang Chen, Chi-Man Pun, Shuqiang Wang

Within this framework, we introduce the Prompt Extraction Block and the Prompt Fusion Block to efficiently encode the cross-modal prompt.

Translation

Paper
Add Code

Perceptual MAE for Image Manipulation Localization: A High-level Vision Learner Focusing on Low-level Features

no code implementations • 10 Oct 2023 • Xiaochen Ma, Jizhe Zhou, Xiong Xu, Zhuohang Jiang, Chi-Man Pun

While MAE has demonstrated an impressive understanding of object semantics, PMAE can also compensate for low-level semantics with our proposed enhancements.

Image Manipulation Image Manipulation Localization

Paper
Add Code

ELF: An End-to-end Local and Global Multimodal Fusion Framework for Glaucoma Grading

no code implementations • 14 Nov 2023 • Wenyun Li, Chi-Man Pun

Glaucoma is a chronic neurodegenerative condition that can lead to blindness.

Paper
Add Code

EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face Generation

no code implementations • 2 Feb 2024 • Guanwen Feng, Haoran Cheng, Yunan Li, Zhiyuan Ma, Chaoneng Li, Zhihao Qian, Qiguang Miao, Chi-Man Pun

Additionally, we propose an emotion intensity control method using a fine-grained emotion matrix.

Attribute Talking Face Generation

Paper
Add Code

Depth-aware Test-Time Training for Zero-shot Video Object Segmentation

no code implementations • 7 Mar 2024 • Weihuang Liu, Xi Shen, Haolun Li, Xiuli Bi, Bo Liu, Chi-Man Pun, Xiaodong Cun

In this work, we introduce a test-time training (TTT) strategy to address the problem.

Depth Estimation Depth Prediction +4

Paper
Add Code

RBA-GCN: Relational Bilevel Aggregation Graph Convolutional Network for Emotion Recognition

1 code implementation • 18 Aug 2023 • Lin Yuan, Guoheng Huang, Fenghuan Li, Xiaochen Yuan, Chi-Man Pun, Guo Zhong

This module can construct the interaction between different modalities and capture long-range contextual information based on similarity clusters.

Emotion Recognition in Conversation Graph Generation

Paper
Code

UWFormer: Underwater Image Enhancement via a Semi-Supervised Multi-Scale Transformer

1 code implementation • 31 Oct 2023 • Weiwen Chen, Yingtie Lei, Shenghong Luo, Ziyang Zhou, Mingxian Li, Chi-Man Pun

Underwater images often exhibit poor quality, distorted color balance and low contrast due to the complex and intricate interplay of light, water, and objects.

Image Enhancement

Paper
Code

ShaDocFormer: A Shadow-Attentive Threshold Detector With Cascaded Fusion Refiner for Document Shadow Removal

1 code implementation • 13 Sep 2023 • Weiwen Chen, Yingtie Lei, Shenghong Luo, Ziyang Zhou, Mingxian Li, Chi-Man Pun

The STD module employs a traditional thresholding technique and leverages the attention mechanism of the Transformer to gather global information, thereby enabling precise detection of shadow masks.

Document Shadow Removal

Paper
Code

COMMA: Co-Articulated Multi-Modal Learning

1 code implementation • 30 Dec 2023 • Lianyu Hu, Liqing Gao, Zekang Liu, Chi-Man Pun, Wei Feng

First, the prompts of the vision and language branches in these methods are usually separated or uni-directionally correlated.

Prompt Engineering

Paper
Code

Quaternion-valued Correlation Learning for Few-Shot Semantic Segmentation

1 code implementation • 12 May 2023 • Zewen Zheng, Guoheng Huang, Xiaochen Yuan, Chi-Man Pun, Hongrui Liu, Wing-Kuen Ling

In this paper, we introduce a quaternion perspective on correlation learning and propose a novel Quaternion-valued Correlation Learning Network (QCLNet), with the aim to alleviate the computational burden of high-dimensional correlation tensor and explore internal latent interaction between query and support images by leveraging operations defined by the established quaternion algebra.

Ranked #19 on Few-Shot Semantic Segmentation on COCO-20i (5-shot)

Few-Shot Semantic Segmentation Semantic Segmentation

Paper
Code

AdaBrowse: Adaptive Video Browser for Efficient Continuous Sign Language Recognition

1 code implementation • 16 Aug 2023 • Lianyu Hu, Liqing Gao, Zekang Liu, Chi-Man Pun, Wei Feng

Then these features are fed into a policy network to intelligently select a subsequence to process.

Ranked #7 on Sign Language Recognition on CSL-Daily

Sentence Sign Language Recognition

Paper
Code

Devignet: High-Resolution Vignetting Removal via a Dual Aggregated Fusion Transformer With Adaptive Channel Expansion

1 code implementation • 26 Aug 2023 • Shenghong Luo, Xuhang Chen, Weiwen Chen, Zinuo Li, Shuqiang Wang, Chi-Man Pun

Vignetting commonly occurs as a degradation in images resulting from factors such as lens design, improper lens hood usage, and limitations in camera sensors.

Vignetting Removal

Paper
Code

ShaDocNet: Learning Spatial-Aware Tokens in Transformer for Document Shadow Removal

1 code implementation • 30 Nov 2022 • Xuhang Chen, Xiaodong Cun, Chi-Man Pun, Shuqiang Wang

Shadow removal improves the visual quality and legibility of digital copies of documents.

Document Shadow Removal Image Shadow Removal +1

Paper
Code

Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization

2 code implementations • 13 Sep 2021 • Jingtang Liang, Xiaodong Cun, Chi-Man Pun, Jue Wang

To this end, we propose a novel spatial-separated curve rendering network(S$^2$CRNet) for efficient and high-resolution image harmonization for the first time.

Ranked #12 on Image Harmonization on iHarmony4

Image Harmonization Image-to-Image Translation +2

Paper
Code

A Large-scale Film Style Dataset for Learning Multi-frequency Driven Film Enhancement

1 code implementation • 21 Jan 2023 • Zinuo Li, Xuhang Chen, Shuqiang Wang, Chi-Man Pun

In order to facilitate film-based image stylization research, we construct FilmSet, a large-scale and high-quality film style dataset.

Film Simulation Image Stylization

Paper
Code

Improving the Harmony of the Composite Image by Spatial-Separated Attention Module

1 code implementation • 15 Jul 2019 • Xiaodong Cun, Chi-Man Pun

Thus, we address the problem of Image Harmonization: Given a spliced image and the mask of the spliced region, we try to harmonize the "style" of the pasted region with the background (non-spliced region).

Ranked #5 on Image Harmonization on HAdobe5k(1024$\times$1024)

Image Harmonization

Paper
Code

Defocus Blur Detection via Depth Distillation

1 code implementation • ECCV 2020 • Xiaodong Cun, Chi-Man Pun

In detail, we learn the defocus blur from ground truth and the depth distilled from a well-trained depth estimation network at the same time.

Defocus Blur Detection Depth Estimation +1

Paper
Code

CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying

1 code implementation • 15 Mar 2023 • Weihuang Liu, Xiaodong Cun, Chi-Man Pun, Menghan Xia, Yong Zhang, Jue Wang

Thanks to the proposed structure, we only encode the high-resolution image in a relatively low resolution for larger reception field capturing.

Image Inpainting Vocal Bursts Intensity Prediction

Paper
Code

Explicit Visual Prompting for Low-Level Structure Segmentations

1 code implementation • CVPR 2023 • Weihuang Liu, Xi Shen, Chi-Man Pun, Xiaodong Cun

Different from the previous visual prompting which is typically a dataset-level implicit embedding, our key insight is to enforce the tunable parameters focusing on the explicit visual content from each individual image, i. e., the features from frozen patch embeddings and the input's high-frequency components.

Ranked #1 on Salient Object Detection on DUT-OMRON

Camouflaged Object Segmentation Defocus Blur Detection +5

164

Paper
Code

Explicit Visual Prompting for Universal Foreground Segmentations

2 code implementations • 29 May 2023 • Weihuang Liu, Xi Shen, Chi-Man Pun, Xiaodong Cun

We take inspiration from the widely-used pre-training and then prompt tuning protocols in NLP and propose a new visual prompting model, named Explicit Visual Prompting (EVP).

Ranked #1 on Salient Object Detection on HKU-IS

Camouflaged Object Segmentation Defocus Blur Detection +5

164

Paper
Code

Sketch Video Synthesis

1 code implementation • 26 Nov 2023 • Yudian Zheng, Xiaodong Cun, Menghan Xia, Chi-Man Pun

Understanding semantic intricacies and high-level concepts is essential in image sketch generation, and this challenge becomes even more formidable when applied to the domain of videos.

Video Editing

179

Paper
Code