Search Results for author: Chi-Man Pun

Found 44 papers, 20 papers with code

Depth Assisted Full Resolution Network for Single Image-based View Synthesis

no code implementations17 Nov 2017 Xiaodong Cun, Feng Xu, Chi-Man Pun, Hao Gao

In this paper, we focus on a more challenging and ill-posed problem that is to synthesize novel viewpoints from one single input image.

Depth Estimation

Generating Adversarial Perturbation with Root Mean Square Gradient

no code implementations13 Jan 2019 Yatie Xiao, Chi-Man Pun, Jizhe Zhou

We focus our attention on the problem of generating adversarial perturbations based on the gradient in image classification domain

Classification General Classification +1

Adaptive Gradient for Adversarial Perturbations Generation

no code implementations1 Feb 2019 Yatie Xiao, Chi-Man Pun

Deep Neural Networks have achieved remarkable success in computer vision, natural language processing, and audio tasks.

Image Classification

Pixelation is NOT Done in Videos Yet

no code implementations26 Mar 2019 Jizhe Zhou, Chi-Man Pun, YingYu Wang

This paper introduces an algorithm to protect the privacy of individuals in streaming video data by blurring faces such that face cannot be reliably recognized.

Clustering Face Detection +2

Generating Minimal Adversarial Perturbations with Integrated Adaptive Gradients

no code implementations12 Apr 2019 Yatie Xiao, Chi-Man Pun

Deep neural networks are easily fooled high confidence predictions for adversarial samples

Image Classification

Improving the Harmony of the Composite Image by Spatial-Separated Attention Module

1 code implementation15 Jul 2019 Xiaodong Cun, Chi-Man Pun

Thus, we address the problem of Image Harmonization: Given a spliced image and the mask of the spliced region, we try to harmonize the "style" of the pasted region with the background (non-spliced region).

Image Harmonization

Defocus Blur Detection via Depth Distillation

1 code implementation ECCV 2020 Xiaodong Cun, Chi-Man Pun

In detail, we learn the defocus blur from ground truth and the depth distilled from a well-trained depth estimation network at the same time.

Defocus Blur Detection Depth Estimation +1

Split then Refine: Stacked Attention-guided ResUNets for Blind Single Image Visible Watermark Removal

1 code implementation13 Dec 2020 Xiaodong Cun, Chi-Man Pun

Simultaneously, to increase the robustness of watermark, attacking technique, such as watermark removal, also gets the attention from the community.

News Image Steganography: A Novel Architecture Facilitates the Fake News Identification

no code implementations3 Jan 2021 Jizhe Zhou, Chi-Man Pun, Yu tong

A larger portion of fake news quotes untampered images from other sources with ulterior motives rather than conducting image forgery.

Extractive Summarization Image Steganography

Privacy-sensitive Objects Pixelation for Live Video Streaming

no code implementations3 Jan 2021 Jizhe Zhou, Chi-Man Pun, Yu tong

With the prevailing of live video streaming, establishing an online pixelation method for privacy-sensitive objects is an urgency.

Clustering

Personal Privacy Protection via Irrelevant Faces Tracking and Pixelation in Video Live Streaming

no code implementations4 Jan 2021 Jizhe Zhou, Chi-Man Pun

On the video live streaming dataset we collected, FPVLS obtains satisfying accuracy, real-time efficiency, and contains the over-pixelation problems.

Clustering Face Detection

Kinship Verification Based on Cross-Generation Feature Interaction Learning

no code implementations7 Sep 2021 Guan-Nan Dong, Chi-Man Pun, Zheng Zhang

Specifically, we take parents and children as a whole to extract the expressive local and non-local features.

Kinship Verification

Deep Collaborative Multi-Modal Learning for Unsupervised Kinship Estimation

no code implementations7 Sep 2021 Guan-Nan Dong, Chi-Man Pun, Zheng Zhang

To this end, we propose a novel deep collaborative multi-modal learning (DCML) to integrate the underlying information presented in facial properties in an adaptive manner to strengthen the facial details for effective unsupervised kinship verification.

Face Recognition Kinship Verification

Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization

2 code implementations13 Sep 2021 Jingtang Liang, Xiaodong Cun, Chi-Man Pun, Jue Wang

To this end, we propose a novel spatial-separated curve rendering network(S$^2$CRNet) for efficient and high-resolution image harmonization for the first time.

Image Harmonization Image-to-Image Translation +2

Image Harmonization with Region-wise Contrastive Learning

no code implementations27 May 2022 Jingtang Liang, Chi-Man Pun

Our method attempts to bring together corresponding positive and negative samples by maximizing the mutual information between the foreground and background styles, which desirably makes our harmonization network more robust to discriminate the foreground and background style features when harmonizing composite images.

Contrastive Learning Image Harmonization

Arbitrary Style Transfer with Structure Enhancement by Combining the Global and Local Loss

no code implementations23 Jul 2022 Lizhen Long, Chi-Man Pun

To solve this problem, we introduce a novel arbitrary style transfer method with structure enhancement by combining the global and local loss.

Classification Style Transfer

Asymmetric Scalable Cross-modal Hashing

no code implementations26 Jul 2022 Wenyun Li, Chi-Man Pun

In addition, most of the existing methods choose to use an $n\times n$ similarity matrix for optimization, which makes the memory and computation unaffordable.

Retrieval

WavEnhancer: Unifying Wavelet and Transformer for Image Enhancement

no code implementations16 Dec 2022 Zinuo Li, Xuhang Chen, Chi-Man Pun, Shuqiang Wang

Image enhancement is a technique that frequently utilized in digital image processing.

Image Enhancement

A Large-scale Film Style Dataset for Learning Multi-frequency Driven Film Enhancement

1 code implementation21 Jan 2023 Zinuo Li, Xuhang Chen, Shuqiang Wang, Chi-Man Pun

In order to facilitate film-based image stylization research, we construct FilmSet, a large-scale and high-quality film style dataset.

Film Simulation Image Stylization

Brain Diffuser: An End-to-End Brain Image to Brain Network Pipeline

no code implementations11 Mar 2023 Xuhang Chen, Baiying Lei, Chi-Man Pun, Shuqiang Wang

Brain network analysis is essential for diagnosing and intervention for Alzheimer's disease (AD).

CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying

1 code implementation15 Mar 2023 Weihuang Liu, Xiaodong Cun, Chi-Man Pun, Menghan Xia, Yong Zhang, Jue Wang

Thanks to the proposed structure, we only encode the high-resolution image in a relatively low resolution for larger reception field capturing.

Image Inpainting Vocal Bursts Intensity Prediction

Explicit Visual Prompting for Low-Level Structure Segmentations

1 code implementation CVPR 2023 Weihuang Liu, Xi Shen, Chi-Man Pun, Xiaodong Cun

Different from the previous visual prompting which is typically a dataset-level implicit embedding, our key insight is to enforce the tunable parameters focusing on the explicit visual content from each individual image, i. e., the features from frozen patch embeddings and the input's high-frequency components.

Camouflaged Object Segmentation Defocus Blur Detection +5

Locality Preserving Multiview Graph Hashing for Large Scale Remote Sensing Image Search

no code implementations10 Apr 2023 Wenyun Li, Guo Zhong, Xingyu Lu, Chi-Man Pun

This article proposes a multiview hashing with learnable parameters to retrieve the queried images for a large-scale remote sensing dataset.

Image Retrieval

Quaternion-valued Correlation Learning for Few-Shot Semantic Segmentation

1 code implementation12 May 2023 Zewen Zheng, Guoheng Huang, Xiaochen Yuan, Chi-Man Pun, Hongrui Liu, Wing-Kuen Ling

In this paper, we introduce a quaternion perspective on correlation learning and propose a novel Quaternion-valued Correlation Learning Network (QCLNet), with the aim to alleviate the computational burden of high-dimensional correlation tensor and explore internal latent interaction between query and support images by leveraging operations defined by the established quaternion algebra.

Few-Shot Semantic Segmentation Semantic Segmentation

Multi-resolution Spatiotemporal Enhanced Transformer Denoising with Functional Diffusive GANs for Constructing Brain Effective Connectivity in MCI analysis

no code implementations18 May 2023 Qiankun Zuo, Chi-Man Pun, Yudong Zhang, Hongfei Wang, Jin Hong

In this paper, a novel Multi-resolution Spatiotemporal Enhanced Transformer Denoising (MSETD) network with an adversarially functional diffusion model is proposed to map functional magnetic resonance imaging (fMRI) into effective connectivity for mild cognitive impairment (MCI) analysis.

Denoising Time Series

Explicit Visual Prompting for Universal Foreground Segmentations

2 code implementations29 May 2023 Weihuang Liu, Xi Shen, Chi-Man Pun, Xiaodong Cun

We take inspiration from the widely-used pre-training and then prompt tuning protocols in NLP and propose a new visual prompting model, named Explicit Visual Prompting (EVP).

Camouflaged Object Segmentation Defocus Blur Detection +5

DocDeshadower: Frequency-aware Transformer for Document Shadow Removal

no code implementations28 Jul 2023 Shenghong Luo, Ruifeng Xu, Xuhang Chen, Zinuo Li, Chi-Man Pun, Shuqiang Wang

In this study, we propose the DocDeshadower, a multi-frequency Transformer-based model built on Laplacian Pyramid.

Document Shadow Removal

RBA-GCN: Relational Bilevel Aggregation Graph Convolutional Network for Emotion Recognition

1 code implementation18 Aug 2023 Lin Yuan, Guoheng Huang, Fenghuan Li, Xiaochen Yuan, Chi-Man Pun, Guo Zhong

This module can construct the interaction between different modalities and capture long-range contextual information based on similarity clusters.

Emotion Recognition in Conversation Graph Generation

Devignet: High-Resolution Vignetting Removal via a Dual Aggregated Fusion Transformer With Adaptive Channel Expansion

1 code implementation26 Aug 2023 Shenghong Luo, Xuhang Chen, Weiwen Chen, Zinuo Li, Shuqiang Wang, Chi-Man Pun

Vignetting commonly occurs as a degradation in images resulting from factors such as lens design, improper lens hood usage, and limitations in camera sensors.

Vignetting Removal

ShaDocFormer: A Shadow-Attentive Threshold Detector With Cascaded Fusion Refiner for Document Shadow Removal

1 code implementation13 Sep 2023 Weiwen Chen, Yingtie Lei, Shenghong Luo, Ziyang Zhou, Mingxian Li, Chi-Man Pun

The STD module employs a traditional thresholding technique and leverages the attention mechanism of the Transformer to gather global information, thereby enabling precise detection of shadow masks.

Document Shadow Removal

MedPrompt: Cross-Modal Prompting for Multi-Task Medical Image Translation

no code implementations4 Oct 2023 Xuhang Chen, Chi-Man Pun, Shuqiang Wang

Within this framework, we introduce the Prompt Extraction Block and the Prompt Fusion Block to efficiently encode the cross-modal prompt.

Translation

Perceptual MAE for Image Manipulation Localization: A High-level Vision Learner Focusing on Low-level Features

no code implementations10 Oct 2023 Xiaochen Ma, Jizhe Zhou, Xiong Xu, Zhuohang Jiang, Chi-Man Pun

While MAE has demonstrated an impressive understanding of object semantics, PMAE can also compensate for low-level semantics with our proposed enhancements.

Image Manipulation Image Manipulation Localization

UWFormer: Underwater Image Enhancement via a Semi-Supervised Multi-Scale Transformer

1 code implementation31 Oct 2023 Weiwen Chen, Yingtie Lei, Shenghong Luo, Ziyang Zhou, Mingxian Li, Chi-Man Pun

Underwater images often exhibit poor quality, distorted color balance and low contrast due to the complex and intricate interplay of light, water, and objects.

Image Enhancement

ELF: An End-to-end Local and Global Multimodal Fusion Framework for Glaucoma Grading

no code implementations14 Nov 2023 Wenyun Li, Chi-Man Pun

Glaucoma is a chronic neurodegenerative condition that can lead to blindness.

Sketch Video Synthesis

1 code implementation26 Nov 2023 Yudian Zheng, Xiaodong Cun, Menghan Xia, Chi-Man Pun

Understanding semantic intricacies and high-level concepts is essential in image sketch generation, and this challenge becomes even more formidable when applied to the domain of videos.

Video Editing

GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation

1 code implementation4 Dec 2023 Jie Wang, Jiu-Cheng Xie, Xianyan Li, Feng Xu, Chi-Man Pun, Hao Gao

Constructing vivid 3D head avatars for given subjects and realizing a series of animations on them is valuable yet challenging.

Novel View Synthesis

COMMA: Co-Articulated Multi-Modal Learning

1 code implementation30 Dec 2023 Lianyu Hu, Liqing Gao, Zekang Liu, Chi-Man Pun, Wei Feng

First, the prompts of the vision and language branches in these methods are usually separated or uni-directionally correlated.

Prompt Engineering

Cannot find the paper you are looking for? You can Submit a new open access paper.