Search Results for author: Zhibo Chen

Found 114 papers, 37 papers with code

Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis?

no code implementations10 Mar 2024 Hanxin Zhu, Tianyu He, Xin Li, Bingchen Li, Zhibo Chen

Neural Radiance Field (NeRF) has achieved superior performance for novel view synthesis by modeling the scene with a Multi-Layer Perception (MLP) and a volume rendering procedure, however, when fewer known views are given (i. e., few-shot view synthesis), the model is prone to overfit the given views.

Novel View Synthesis

SeD: Semantic-Aware Discriminator for Image Super-Resolution

no code implementations29 Feb 2024 Bingchen Li, Xin Li, Hanxin Zhu, Yeying Jin, Ruoyu Feng, Zhizheng Zhang, Zhibo Chen

In particular, one discriminator is utilized to enable the SR network to learn the distribution of real-world high-quality images in an adversarial training manner.

Image Super-Resolution

CMC: Few-shot Novel View Synthesis via Cross-view Multiplane Consistency

no code implementations26 Feb 2024 Hanxin Zhu, Tianyu He, Zhibo Chen

Furthermore, to regularize the unseen target views, we constrain the rendered colors and depths from different input views to be the same.

Novel View Synthesis

KVQ: Kwai Video Quality Assessment for Short-form Videos

no code implementations11 Feb 2024 Yiting Lu, Xin Li, Yajing Pei, Kun Yuan, Qizhi Xie, Yunpeng Qu, Ming Sun, Chao Zhou, Zhibo Chen

Short-form UGC video platforms, like Kwai and TikTok, have been an emerging and irreplaceable mainstream media form, thriving on user-friendly engagement, and kaleidoscope creation, etc.

Video Quality Assessment Visual Question Answering (VQA)

Conditional Neural Video Coding with Spatial-Temporal Super-Resolution

no code implementations25 Jan 2024 Henan Wang, Xiaohan Pan, Runsen Feng, Zongyu Guo, Zhibo Chen

This document is an expanded version of a one-page abstract originally presented at the 2024 Data Compression Conference.

Data Compression Image Compression +2

High-Fidelity Diffusion-based Image Editing

no code implementations25 Dec 2023 Chen Hou, Guoqiang Wei, Zhibo Chen

Diffusion models have attained remarkable success in the domains of image generation and editing.

Denoising Image Reconstruction +1

Diagnosis-oriented Medical Image Compression with Efficient Transfer Learning

no code implementations20 Oct 2023 Guangqi Xie, Xin Li, Xiaohan Pan, Zhibo Chen

Remote medical diagnosis has emerged as a critical and indispensable technique in practical medical systems, where medical data are required to be efficiently compressed and transmitted for diagnosis by either professional doctors or intelligent diagnosis devices.

Coronary Artery Segmentation Image Compression +2

FreqAlign: Excavating Perception-oriented Transferability for Blind Image Quality Assessment from A Frequency Perspective

no code implementations29 Sep 2023 Xin Li, Yiting Lu, Zhibo Chen

Based on this, we propose to improve the perception-oriented transferability of BIQA by performing feature frequency decomposition and selecting the frequency components that contained the most transferable perception knowledge for alignment.

Blind Image Quality Assessment Unsupervised Domain Adaptation

CCEdit: Creative and Controllable Video Editing via Diffusion Models

no code implementations28 Sep 2023 Ruoyu Feng, Wenming Weng, Yanhui Wang, Yuhui Yuan, Jianmin Bao, Chong Luo, Zhibo Chen, Baining Guo

The versatility of our framework is demonstrated through a diverse range of choices in both structure representations and personalized T2I models, as well as the option to provide the edited key frame.

Text-to-Image Generation Video Editing

GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph

1 code implementation NeurIPS 2023 Xin Li, Dongze Lian, Zhihe Lu, Jiawang Bai, Zhibo Chen, Xinchao Wang

To mitigate that, we propose an effective adapter-style tuning strategy, dubbed GraphAdapter, which performs the textual adapter by explicitly modeling the dual-modality structure knowledge (i. e., the correlation of different semantics/classes in textual and visual modalities) with a dual knowledge graph.

Transfer Learning

Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey

1 code implementation18 Aug 2023 Xin Li, Yulin Ren, Xin Jin, Cuiling Lan, Xingrui Wang, Wenjun Zeng, Xinchao Wang, Zhibo Chen

Image restoration (IR) has been an indispensable and challenging task in the low-level vision field, which strives to improve the subjective quality of images distorted by various forms of degradation.

Deblurring Image Restoration +2

Compression with Bayesian Implicit Neural Representations

1 code implementation NeurIPS 2023 Zongyu Guo, Gergely Flamich, Jiajun He, Zhibo Chen, José Miguel Hernández-Lobato

Many common types of data can be represented as functions that map coordinates to signal values, such as pixel locations to RGB values in the case of an image.

Quantization

NVTC: Nonlinear Vector Transform Coding

1 code implementation CVPR 2023 Runsen Feng, Zongyu Guo, Weiping Li, Zhibo Chen

In theory, vector quantization (VQ) is always better than scalar quantization (SQ) in terms of rate-distortion (R-D) performance.

Image Compression Quantization

Exploring the Rate-Distortion-Complexity Optimization in Neural Image Compression

no code implementations12 May 2023 Yixin Gao, Runsen Feng, Zongyu Guo, Zhibo Chen

By quantifying the decoding complexity as a factor in the optimization goal, we are now able to precisely control the RDC trade-off and then demonstrate how the rate-distortion performance of neural image codecs could adapt to various complexity demands.

Image Compression

Prompt-ICM: A Unified Framework towards Image Coding for Machines with Task-driven Prompts

no code implementations4 May 2023 Ruoyu Feng, Jinming Liu, Xin Jin, Xiaohan Pan, Heming Sun, Zhibo Chen

For ICM, developing a unified codec to reduce information redundancy while empowering the compressed features to support various vision tasks is very important, which inevitably faces two core challenges: 1) How should the compression strategy be adjusted based on the downstream tasks?

Semantically Structured Image Compression via Irregular Group-Based Decoupling

no code implementations ICCV 2023 Ruoyu Feng, Yixin Gao, Xin Jin, Runsen Feng, Zhibo Chen

Nevertheless, they divide the input image into multiple rectangular regions according to semantics and ignore avoiding information interaction among them, causing waste of bitrate and distorted reconstruction of region boundaries.

Image Compression

Inpaint Anything: Segment Anything Meets Image Inpainting

1 code implementation13 Apr 2023 Tao Yu, Runseng Feng, Ruoyu Feng, Jinming Liu, Xin Jin, Wenjun Zeng, Zhibo Chen

We are also very willing to help everyone share and promote new projects based on our Inpaint Anything (IA).

Image Inpainting

Learning Distortion Invariant Representation for Image Restoration from A Causality Perspective

2 code implementations CVPR 2023 Xin Li, Bingchen Li, Xin Jin, Cuiling Lan, Zhibo Chen

In this paper, we are the first to propose a novel training strategy for image restoration from the causality perspective, to improve the generalization ability of DNNs for unknown degradations.

counterfactual Image Restoration +2

Versatile Neural Processes for Learning Implicit Neural Representations

1 code implementation21 Jan 2023 Zongyu Guo, Cuiling Lan, Zhizheng Zhang, Yan Lu, Zhibo Chen

In this paper, we propose an efficient NP framework dubbed Versatile Neural Processes (VNP), which largely increases the capability of approximating functions.

Semantic-aware Message Broadcasting for Efficient Unsupervised Domain Adaptation

1 code implementation6 Dec 2022 Xin Li, Cuiling Lan, Guoqiang Wei, Zhibo Chen

In this way, our message broadcasting encourages the group tokens to learn more informative and diverse information for effective domain alignment.

Pseudo Label Unsupervised Domain Adaptation

Task Residual for Tuning Vision-Language Models

1 code implementation CVPR 2023 Tao Yu, Zhihe Lu, Xin Jin, Zhibo Chen, Xinchao Wang

Large-scale vision-language models (VLMs) pre-trained on billion-level data have learned general visual representations and broad visual concepts.

Transfer Learning

MiNL: Micro-images based Neural Representation for Light Fields

no code implementations17 Sep 2022 Hanxin Zhu, Henan Wang, Zhibo Chen

Unlike explicit representation that represents light fields as Sub-Aperture Images (SAIs) based arrays or Micro-Images (MIs) based lenslet images, implicit representation treats light fields as neural networks, which is inherently a continuous representation in contrast to discrete explicit representation.

Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation

no code implementations24 Aug 2022 Guangqi Xie, Xin Li, Shiqi Lin, Li Zhang, Kai Zhang, Yue Li, Zhibo Chen

In this paper, we take a step forward to video semantic compression and propose the Hierarchical Reinforcement Learning based task-driven Video Semantic Coding, named as HRLVSC.

Hierarchical Reinforcement Learning reinforcement-learning +3

Learned Lossless JPEG Transcoding via Joint Lossy and Residual Compression

no code implementations24 Aug 2022 Xiaoshuai Fan, Xin Li, Zhibo Chen

Our proposed transcoding architecture shows significant superiority in the compression of JPEG images thanks to the collaboration of learned lossy transform coding and residual entropy coding.

Image Compression

HST: Hierarchical Swin Transformer for Compressed Image Super-resolution

3 code implementations21 Aug 2022 Bingchen Li, Xin Li, Yiting Lu, Sen Liu, Ruoyu Feng, Zhibo Chen

Compressed Image Super-resolution has achieved great attention in recent years, where images are degraded with compression artifacts and low-resolution artifacts.

Compressed Image Super-resolution Image Super-Resolution

StyleAM: Perception-Oriented Unsupervised Domain Adaption for Non-reference Image Quality Assessment

no code implementations29 Jul 2022 Yiting Lu, Xin Li, Jianzhao Liu, Zhibo Chen

Specifically, we find a more compact and reliable space i. e., feature style space for perception-oriented UDA based on an interesting/amazing observation, that the feature style (i. e., the mean and variance) of the deep layer in DNNs is exactly associated with the quality score in NR-IQA.

Image Quality Assessment NR-IQA +1

Source-free Unsupervised Domain Adaptation for Blind Image Quality Assessment

no code implementations17 Jul 2022 Jianzhao Liu, Xin Li, Shukun An, Zhibo Chen

Thanks to the development of unsupervised domain adaptation (UDA), some works attempt to transfer the knowledge from a label-sufficient source domain to a label-free target domain under domain shift with UDA.

Blind Image Quality Assessment Unsupervised Domain Adaptation

RTN: Reinforced Transformer Network for Coronary CT Angiography Vessel-level Image Quality Assessment

no code implementations13 Jul 2022 Yiting Lu, Jun Fu, Xin Li, Wei Zhou, Sen Liu, Xinxin Zhang, Congfu Jia, Ying Liu, Zhibo Chen

Therefore, we propose a Progressive Reinforcement learning based Instance Discarding module (termed as PRID) to progressively remove quality-irrelevant/negative instances for CCTA VIQA.

Image Quality Assessment Multiple Instance Learning

Image Coding for Machines with Omnipotent Feature Learning

no code implementations5 Jul 2022 Ruoyu Feng, Xin Jin, Zongyu Guo, Runsen Feng, Yixin Gao, Tianyu He, Zhizheng Zhang, Simeng Sun, Zhibo Chen

Learning a kind of feature that is both general (for AI tasks) and compact (for compression) is pivotal for its success.

Self-Supervised Learning

SwinIQA: Learned Swin Distance for Compressed Image Quality Assessment

1 code implementation9 May 2022 Jianzhao Liu, Xin Li, Yanding Peng, Tao Yu, Zhibo Chen

In this paper, we design a full-reference image quality assessment metric SwinIQA to measure the perceptual quality of compressed images in a learned Swin distance space.

Compressed Image Quality Assessment Image Compression +1

Deep Frequency Filtering for Domain Generalization

no code implementations CVPR 2023 Shiqi Lin, Zhizheng Zhang, Zhipeng Huang, Yan Lu, Cuiling Lan, Peng Chu, Quanzeng You, Jiang Wang, Zicheng Liu, Amey Parulkar, Viraj Navkal, Zhibo Chen

Improving the generalization ability of Deep Neural Networks (DNNs) is critical for their practical uses, which has been a longstanding challenge.

Domain Generalization Retrieval

Active Token Mixer

2 code implementations11 Mar 2022 Guoqiang Wei, Zhizheng Zhang, Cuiling Lan, Yan Lu, Zhibo Chen

In this work, we propose an innovative token-mixer, dubbed Active Token Mixer (ATM), to actively incorporate flexible contextual information distributed across different channels from other tokens into the given query token.

Image Classification Instance Segmentation +2

Mask-based Latent Reconstruction for Reinforcement Learning

1 code implementation28 Jan 2022 Tao Yu, Zhizheng Zhang, Cuiling Lan, Yan Lu, Zhibo Chen

For deep reinforcement learning (RL) from pixels, learning effective state representations is crucial for achieving high performance.

reinforcement-learning Reinforcement Learning (RL) +1

Semantically Video Coding: Instill Static-Dynamic Clues into Structured Bitstream for AI Tasks

no code implementations25 Jan 2022 Xin Jin, Ruoyu Feng, Simeng Sun, Runsen Feng, Tianyu He, Zhibo Chen

Traditional media coding schemes typically encode image/video into a semantic-unknown binary stream, which fails to directly support downstream intelligent tasks at the bitstream level.

Action Recognition Object +8

Confounder Identification-free Causal Visual Feature Learning

no code implementations26 Nov 2021 Xin Li, Zhizheng Zhang, Guoqiang Wei, Cuiling Lan, Wenjun Zeng, Xin Jin, Zhibo Chen

In this paper, we propose a novel Confounder Identification-free Causal Visual Feature Learning (CICF) method, which obviates the need for identifying confounders.

Domain Generalization Meta-Learning

A Close Look at Few-shot Real Image Super-resolution from the Distortion Relation Perspective

no code implementations25 Nov 2021 Xin Li, Xin Jin, Jun Fu, Xiaoyuan Yu, Bei Tong, Zhibo Chen

Under this brand-new scenario, we propose Distortion Relation guided Transfer Learning (DRTL) for the few-shot RealSR by transferring the rich restoration knowledge from auxiliary distortions (i. e., synthetic distortions) to the target RealSR under the guidance of distortion relation.

Image Restoration Image Super-Resolution +4

Meta Clustering Learning for Large-scale Unsupervised Person Re-identification

no code implementations19 Nov 2021 Xin Jin, Tianyu He, Xu Shen, Tongliang Liu, Xinchao Wang, Jianqiang Huang, Zhibo Chen, Xian-Sheng Hua

Unsupervised Person Re-identification (U-ReID) with pseudo labeling recently reaches a competitive performance compared to fully-supervised ReID methods based on modern clustering algorithms.

Clustering Unsupervised Person Re-Identification

Texture-enhanced Light Field Super-resolution with Spatio-Angular Decomposition Kernels

no code implementations7 Nov 2021 Zexi Hu, Xiaoming Chen, Henry Wing Fung Yeung, Yuk Ying Chung, Zhibo Chen

Despite the recent progress in light field super-resolution (LFSR) achieved by convolutional neural networks, the correlation information of light field (LF) images has not been sufficiently studied and exploited due to the complexity of 4D LF data.

Material Recognition Super-Resolution

Unleash the Potential of Adaptation Models via Dynamic Domain Labels

no code implementations29 Sep 2021 Xin Jin, Tianyu He, Xu Shen, Songhua Wu, Tongliang Liu, Xinchao Wang, Jianqiang Huang, Zhibo Chen, Xian-Sheng Hua

In this paper, we propose an embarrassing simple yet highly effective adversarial domain adaptation (ADA) method for effectively training models for alignment.

Domain Adaptation Memorization

ToAlign: Task-oriented Alignment for Unsupervised Domain Adaptation

1 code implementation NeurIPS 2021 Guoqiang Wei, Cuiling Lan, Wenjun Zeng, Zhizheng Zhang, Zhibo Chen

Unsupervised domain adaptive classifcation intends to improve the classifcation performance on unlabeled target domain.

Unsupervised Domain Adaptation

Task-driven Semantic Coding via Reinforcement Learning

1 code implementation7 Jun 2021 Xin Li, Jun Shi, Zhibo Chen

However, the traditional hybrid coding framework cannot be optimized in an end-to-end manner, which makes task-driven semantic fidelity metric unable to be automatically integrated into the rate-distortion optimization process.

Face Detection License Plate Detection +4

Image Super-Resolution Quality Assessment: Structural Fidelity Versus Statistical Naturalness

1 code implementation15 May 2021 Wei Zhou, Zhou Wang, Zhibo Chen

In this paper, we assess the quality of SISR generated images in a two-dimensional (2D) space of structural fidelity versus statistical naturalness.

Generative Adversarial Network Image Quality Assessment +1

Bayesian Graph Convolutional Network for Traffic Prediction

no code implementations1 Apr 2021 Jun Fu, Wei Zhou, Zhibo Chen

Under this framework, the graph structure is viewed as a random realization from a parametric generative model, and its posterior is inferred using the observed topology of the road network and traffic data.

Traffic Prediction

Cloth-Changing Person Re-identification from A Single Image with Gait Prediction and Regularization

1 code implementation CVPR 2022 Xin Jin, Tianyu He, Kecheng Zheng, Zhiheng Yin, Xu Shen, Zhen Huang, Ruoyu Feng, Jianqiang Huang, Xian-Sheng Hua, Zhibo Chen

Specifically, we introduce Gait recognition as an auxiliary task to drive the Image ReID model to learn cloth-agnostic representations by leveraging personal unique and cloth-independent gait information, we name this framework as GI-ReID.

Cloth-Changing Person Re-Identification Computational Efficiency +1

Disentanglement-based Cross-Domain Feature Augmentation for Effective Unsupervised Domain Adaptive Person Re-identification

no code implementations25 Mar 2021 Zhizheng Zhang, Cuiling Lan, Wenjun Zeng, Quanzeng You, Zicheng Liu, Kecheng Zheng, Zhibo Chen

Each recomposed feature, obtained based on the domain-invariant feature (which enables a reliable inheritance of identity) and an enhancement from a domain specific feature (which enables the approximation of real distributions), is thus an "ideal" augmentation.

Disentanglement Domain Adaptive Person Re-Identification +2

MetaAlign: Coordinating Domain Alignment and Classification for Unsupervised Domain Adaptation

1 code implementation CVPR 2021 Guoqiang Wei, Cuiling Lan, Wenjun Zeng, Zhibo Chen

For unsupervised domain adaptation (UDA), to alleviate the effect of domain shift, many approaches align the source and target domains in the feature space by adversarial learning or by explicitly aligning their statistics.

Classification General Classification +5

Re-energizing Domain Discriminator with Sample Relabeling for Adversarial Domain Adaptation

no code implementations ICCV 2021 Xin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen

Many unsupervised domain adaptation (UDA) methods exploit domain adversarial training to align the features to reduce domain gap, where a feature extractor is trained to fool a domain discriminator in order to have aligned feature distributions.

Unsupervised Domain Adaptation

Local Patch AutoAugment with Multi-Agent Collaboration

2 code implementations20 Mar 2021 Shiqi Lin, Tao Yu, Ruoyu Feng, Xin Li, Xin Jin, Zhibo Chen

We formulate it as a multi-agent reinforcement learning (MARL) problem, where each agent learns an augmentation policy for each patch based on its content together with the semantics of the whole image.

Data Augmentation Fine-Grained Image Recognition +2

Dense Interaction Learning for Video-based Person Re-identification

no code implementations ICCV 2021 Tianyu He, Xin Jin, Xu Shen, Jianqiang Huang, Zhibo Chen, Xian-Sheng Hua

The CNN encoder is responsible for efficiently extracting discriminative spatial features while the DI decoder is designed to densely model spatial-temporal inherent interaction across frames.

Video-Based Person Re-Identification

No-Reference Quality Assessment for 360-degree Images by Analysis of Multi-frequency Information and Local-global Naturalness

no code implementations22 Feb 2021 Wei Zhou, Jiahua Xu, Qiuping Jiang, Zhibo Chen

To our knowledge, the proposed model is the first no-reference quality assessment method for 360-degreee images that combines multi-frequency information and image naturalness.

Image Quality Assessment

Image-to-Image Translation: Methods and Applications

no code implementations21 Jan 2021 Yingxue Pang, Jianxin Lin, Tao Qin, Zhibo Chen

Image-to-image translation (I2I) aims to transfer images from a source domain to a target domain while preserving the content representations.

Image-to-Image Translation Pose Estimation +2

Style Normalization and Restitution for Domain Generalization and Adaptation

1 code implementation3 Jan 2021 Xin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen

In this paper, we design a novel Style Normalization and Restitution module (SNR) to simultaneously ensure both high generalization and discrimination capability of the networks.

Disentanglement Domain Generalization +4

Learned Block-based Hybrid Image Compression

no code implementations17 Dec 2020 Yaojun Wu, Xin Li, Zhizheng Zhang, Xin Jin, Zhibo Chen

Recent works on learned image compression perform encoding and decoding processes in a full-resolution manner, resulting in two problems when deployed for practical applications.

Blocking Image Compression +2

Learning Omni-frequency Region-adaptive Representations for Real Image Super-Resolution

no code implementations11 Dec 2020 Xin Li, Xin Jin, Tao Yu, Yingxue Pang, Simeng Sun, Zhizheng Zhang, Zhibo Chen

Traditional single image super-resolution (SISR) methods that focus on solving single and uniform degradation (i. e., bicubic down-sampling), typically suffer from poor performance when applied into real-world low-resolution (LR) images due to the complicated realistic degradations.

Image Super-Resolution

Deep Multi-Scale Features Learning for Distorted Image Quality Assessment

no code implementations1 Dec 2020 Wei Zhou, Zhibo Chen

In this paper, motivated by the human visual system (HVS) combining multi-scale features for perception, we propose to use pyramid features learning to build a DNN with hierarchical multi-scale features for distorted image quality prediction.

Image Quality Assessment

Causal Contextual Prediction for Learned Image Compression

no code implementations19 Nov 2020 Zongyu Guo, Zhizheng Zhang, Runsen Feng, Zhibo Chen

In this paper, we propose the concept of separate entropy coding to leverage a serial decoding process for causal contextual entropy prediction in the latent space.

Image Compression MS-SSIM +1

Improving Machine Reading Comprehension with Single-choice Decision and Transfer Learning

no code implementations6 Nov 2020 Yufan Jiang, Shuangzhi Wu, Jing Gong, Yahui Cheng, Peng Meng, Weiliang Lin, Zhibo Chen, Mu Li

In addition, by transferring knowledge from other kinds of MRC tasks, our model achieves a new state-of-the-art results in both single and ensemble settings.

AutoML Binary Classification +2

Bayesian Spatio-Temporal Graph Convolutional Network for Traffic Forecasting

no code implementations15 Oct 2020 Jun Fu, Wei Zhou, Zhibo Chen

The graph structure in our network is learned from the physical topology of the road network and traffic data in an end-to-end manner, which discovers a more accurate description of the relationship among traffic flows.

Traffic Prediction

Uncertainty-Aware Few-Shot Image Classification

no code implementations9 Oct 2020 Zhizheng Zhang, Cuiling Lan, Wenjun Zeng, Zhibo Chen, Shih-Fu Chang

In this work, we propose Uncertainty-Aware Few-Shot framework for image classification by modeling uncertainty of the similarities of query-support pairs and performing uncertainty-aware optimization.

Classification Few-Shot Image Classification +3

FAN: Frequency Aggregation Network for Real Image Super-resolution

no code implementations30 Sep 2020 Yingxue Pang, Xin Li, Xin Jin, Yaojun Wu, Jianzhao Liu, Sen Liu, Zhibo Chen

Specifically, we extract different frequencies of the LR image and pass them to a channel attention-grouped residual dense network (CA-GRDB) individually to output corresponding feature maps.

Image Super-Resolution SSIM

LIRA: Lifelong Image Restoration from Unknown Blended Distortions

no code implementations ECCV 2020 Jianzhao Liu, Jianxin Lin, Xin Li, Wei Zhou, Sen Liu, Zhibo Chen

Most existing image restoration networks are designed in a disposable way and catastrophically forget previously learned distortions when trained on a new distortion removal task.

Image Restoration SSIM

Feature Alignment and Restoration for Domain Generalization and Adaptation

no code implementations22 Jun 2020 Xin Jin, Cuiling Lan, Wen-Jun Zeng, Zhibo Chen

To ensure high discrimination, we propose a Feature Restoration (FR) operation to distill task-relevant features from the residual information and use them to compensate for the aligned features.

Disentanglement Domain Generalization +1

Beyond Triplet Loss: Meta Prototypical N-tuple Loss for Person Re-identification

no code implementations8 Jun 2020 Zhizheng Zhang, Cuiling Lan, Wen-Jun Zeng, Zhibo Chen, Shih-Fu Chang

There is a lack of loss design which enables the joint optimization of multiple instances (of multiple classes) within per-query optimization for person ReID.

Classification General Classification +3

Residual Squeeze-and-Excitation Network for Fast Image Deraining

1 code implementation1 Jun 2020 Jun Fu, Jianfeng Xu, Kazuyuki Tasaka, Zhibo Chen

Image deraining is an important image processing task as rain streaks not only severely degrade the visual quality of images but also significantly affect the performance of high-level vision tasks.

Rain Removal

Global Distance-distributions Separation for Unsupervised Person Re-identification

no code implementations ECCV 2020 Xin Jin, Cuiling Lan, Wen-Jun Zeng, Zhibo Chen

To address this problem, we introduce a global distance-distributions separation (GDS) constraint over the two distributions to encourage the clear separation of positive and negative samples from a global view.

Domain Adaptation POS +1

Multi-scale Grouped Dense Network for VVC Intra Coding

no code implementations16 May 2020 Xin Li, Simeng Sun, Zhizheng Zhang, Zhibo Chen

Versatile Video Coding (H. 266/VVC) standard achieves better image quality when keeping the same bits than any other conventional image codec, such as BPG, JPEG, and etc.

Generative Adversarial Network

Blind Quality Assessment for Image Superresolution Using Deep Two-Stream Convolutional Networks

no code implementations13 Apr 2020 Wei Zhou, Qiuping Jiang, Yuwang Wang, Zhibo Chen, Weiping Li

Numerous image superresolution (SR) algorithms have been proposed for reconstructing high-resolution (HR) images from input images with lower spatial resolutions.

Image Quality Assessment

TuiGAN: Learning Versatile Image-to-Image Translation with Two Unpaired Images

1 code implementation ECCV 2020 Jianxin Lin, Yingxue Pang, Yingce Xia, Zhibo Chen, Jiebo Luo

With TuiGAN, an image is translated in a coarse-to-fine manner where the generated image is gradually refined from global structures to local details.

Translation Unsupervised Image-To-Image Translation +1

Multi-Granularity Reference-Aided Attentive Feature Aggregation for Video-based Person Re-identification

no code implementations CVPR 2020 Zhizheng Zhang, Cuiling Lan, Wen-Jun Zeng, Zhibo Chen

In this paper, we propose an attentive feature aggregation module, namely Multi-Granularity Reference-aided Attentive Feature Aggregation (MG-RAFA), to delicately aggregate spatio-temporal features into a discriminative video-level feature representation.

Video-Based Person Re-Identification

Prior-enlightened and Motion-robust Video Deblurring

no code implementations25 Mar 2020 Ya Zhou, Jianfeng Xu, Kazuyuki Tasaka, Zhibo Chen, Weiping Li

Various blur distortions in video will cause negative impact on both human viewing and video-based applications, which makes motion-robust deblurring methods urgently needed.

Deblurring

SASL: Saliency-Adaptive Sparsity Learning for Neural Network Acceleration

no code implementations12 Mar 2020 Jun Shi, Jianfeng Xu, Kazuyuki Tasaka, Zhibo Chen

Accelerating the inference speed of CNNs is critical to their deployment in real-world applications.

Uncertainty-Aware Multi-Shot Knowledge Distillation for Image-Based Object Re-Identification

no code implementations15 Jan 2020 Xin Jin, Cuiling Lan, Wen-Jun Zeng, Zhibo Chen

To the best of our knowledge, we are the first to make use of multi-shots of an object in a teacher-student learning manner for effectively boosting the single image based re-id.

Knowledge Distillation Object

Generative Memorize-Then-Recall framework for low bit-rate Surveillance Video Compression

no code implementations30 Dec 2019 Yaojun Wu, Tianyu He, Zhibo Chen

In this paper, we figure out this issue by disentangling surveillance video into the structure of a global spatio-temporal feature (memory) for Group of Picture (GoP) and skeleton for each frame (clue).

Generative Adversarial Network Motion Compensation +1

Region Normalization for Image Inpainting

1 code implementation23 Nov 2019 Tao Yu, Zongyu Guo, Xin Jin, Shilin Wu, Zhibo Chen, Weiping Li, Zhizheng Zhang, Sen Liu

In this work, we show that the mean and variance shifts caused by full-spatial FN limit the image inpainting network training and we propose a spatial region-wise normalization named Region Normalization (RN) to overcome the limitation.

Image Inpainting

Reinforced Bit Allocation under Task-Driven Semantic Distortion Metrics

no code implementations16 Oct 2019 Jun Shi, Zhibo Chen

Rapid growing intelligent applications require optimized bit allocation in image/video coding to support specific task-driven scenarios such as detection, classification, segmentation, etc.

General Classification Quantization +1

Tensor Oriented No-Reference Light Field Image Quality Assessment

no code implementations5 Sep 2019 Wei Zhou, Likun Shi, Zhibo Chen, Jinglin Zhang

Light field image (LFI) quality assessment is becoming more and more important, which helps to better guide the acquisition, processing and application of immersive media.

Image Quality Assessment

Binocular Rivalry Oriented Predictive Auto-Encoding Network for Blind Stereoscopic Image Quality Measurement

1 code implementation4 Sep 2019 Jiahua Xu, Wei Zhou, Zhibo Chen, Suiyi Ling, Patrick Le Callet

Stereoscopic image quality measurement (SIQM) has become increasingly important for guiding stereo image processing and commutation systems due to the widespread usage of 3D contents.

Multimedia Image and Video Processing

No-Reference Light Field Image Quality Assessment Based on Spatial-Angular Measurement

no code implementations17 Aug 2019 Likun Shi, Wei Zhou, Zhibo Chen, Jinglin Zhang

In this paper, we propose a No-Reference Light Field image Quality Assessment (NR-LFQA) scheme, where the main idea is to quantify the LFI quality degradation through evaluating the spatial quality and angular consistency.

Image Quality Assessment

Progressive Image Inpainting with Full-Resolution Residual Network

2 code implementations24 Jul 2019 Zongyu Guo, Zhibo Chen, Tao Yu, Jiale Chen, Sen Liu

Recently, learning-based algorithms for image inpainting achieve remarkable progress dealing with squared or irregular holes.

Image Inpainting

Stereoscopic Omnidirectional Image Quality Assessment Based on Predictive Coding Theory

no code implementations12 Jun 2019 Zhibo Chen, Jiahua Xu, Chaoyi Lin, Wei Zhou

In this paper, based on the predictive coding theory of the human vision system (HVS), we propose a stereoscopic omnidirectional image quality evaluator (SOIQE) to cope with the characteristics of 3D 360-degree images.

Image Quality Assessment

A Coarse-to-Fine Framework for Learned Color Enhancement with Non-Local Attention

no code implementations8 Jun 2019 Chaowei Shan, Zhizheng Zhang, Zhibo Chen

For current learned methods in this field, global harmonious perception and local details are hard to be well-considered in a single model simultaneously.

Learning to Transfer: Unsupervised Meta Domain Translation

1 code implementation1 Jun 2019 Jianxin Lin, Yijun Wang, Tianyu He, Zhibo Chen

Unsupervised domain translation has recently achieved impressive performance with Generative Adversarial Network (GAN) and sufficient (unpaired) training data.

Generative Adversarial Network Meta-Learning +1

Semantics-Aligned Representation Learning for Person Re-identification

1 code implementation30 May 2019 Xin Jin, Cuiling Lan, Wen-Jun Zeng, Guoqiang Wei, Zhibo Chen

Specifically, we build a Semantics Aligning Network (SAN) which consists of a base network as encoder (SA-Enc) for re-ID, and a decoder (SA-Dec) for reconstructing/regressing the densely semantics aligned full texture image.

Person Re-Identification Representation Learning +1

Image-to-Image Translation with Multi-Path Consistency Regularization

no code implementations29 May 2019 Jianxin Lin, Yingce Xia, Yijun Wang, Tao Qin, Zhibo Chen

In this work, we introduce a new kind of loss, multi-path consistency loss, which evaluates the differences between direct translation $\mathcal{D}_s\to\mathcal{D}_t$ and indirect translation $\mathcal{D}_s\to\mathcal{D}_a\to\mathcal{D}_t$ with $\mathcal{D}_a$ as an auxiliary domain, to regularize training.

Face to Face Translation Image-to-Image Translation +1

Towards Accurate One-Stage Object Detection with AP-Loss

1 code implementation CVPR 2019 Kean Chen, Jianguo Li, Weiyao Lin, John See, Ji Wang, Ling-Yu Duan, Zhibo Chen, Changwei He, Junni Zou

For this purpose, we develop a novel optimization algorithm, which seamlessly combines the error-driven update scheme in perceptron learning and backpropagation algorithm in deep networks.

Classification General Classification +3

Relation-Aware Global Attention for Person Re-identification

1 code implementation CVPR 2020 Zhizheng Zhang, Cuiling Lan, Wen-Jun Zeng, Xin Jin, Zhibo Chen

For person re-identification (re-id), attention mechanisms have become attractive as they aim at strengthening discriminative features and suppressing irrelevant ones, which matches well the key of re-id, i. e., discriminative feature learning.

Clustering Image Classification +3

Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex Environments

1 code implementation3 Mar 2019 Zhizheng Zhang, Jiale Chen, Zhibo Chen, Weiping Li

Not limited to the control tasks in computationally complex environments, AE-DDPG also achieves higher rewards and 2- to 4-fold improvement in sample efficiency on average compared to other variants of DDPG in MuJoCo environments.

Continuous Control Reinforcement Learning (RL)

View Invariant 3D Human Pose Estimation

no code implementations30 Jan 2019 Guoqiang Wei, Cuiling Lan, Wen-Jun Zeng, Zhibo Chen

The diversity of capturing viewpoints and the flexibility of the human poses, however, remain some significant challenges.

3D Human Pose Estimation 3D Pose Estimation

Learning based Facial Image Compression with Semantic Fidelity Metric

no code implementations25 Dec 2018 Zhibo Chen, Tianyu He

The experimental results verify the framework's efficiency by demonstrating performance improvement of 71. 41%, 48. 28% and 52. 67% bitrate saving separately over JPEG2000, WebP and neural network-based codecs under the same face verification accuracy distortion metric.

Face Recognition Face Verification +2

Sequential Gating Ensemble Network for Noise Robust Multi-Scale Face Restoration

1 code implementation19 Dec 2018 Zhibo Chen, Jianxin Lin, Tiankuang Zhou, Feng Wu

The SGU sequentially takes information from two different levels as inputs and decides the output based on one active input.

Ensemble Learning Image Restoration

Unsupervised Single Image Deraining with Self-supervised Constraints

no code implementations21 Nov 2018 Xin Jin, Zhibo Chen, Jianxin Lin, Zhikai Chen, Wei Zhou

Most existing single image deraining methods require learning supervised models from a large set of paired synthetic training data, which limits their generality, scalability and practicality in real-world multimedia applications.

Benchmarking Generative Adversarial Network +1

DeepIR: A Deep Semantics Driven Framework for Image Retargeting

no code implementations19 Nov 2018 Jianxin Lin, Tiankuang Zhou, Zhibo Chen

We present \emph{Deep Image Retargeting} (\emph{DeepIR}), a coarse-to-fine framework for content-aware image retargeting.

Image Retargeting

Distribution Discrepancy Maximization for Image Privacy Preserving

no code implementations18 Nov 2018 Sen Liu, Jianxin Lin, Zhibo Chen

Accordingly, we introduce a collaborative training scheme: a discriminator $D$ is trained to discriminate the reconstructed image from the encrypted image, and an encryption model $G_e$ is required to generate these two kinds of images to maximize the recognition rate of $D$, leading to the same training objective for both $D$ and $G_e$.

Privacy Preserving

Multi-Scale Face Restoration with Sequential Gating Ensemble Network

no code implementations6 May 2018 Jianxin Lin, Tiankuang Zhou, Zhibo Chen

Experiment results demonstrate that our SGEN is more effective at multi-scale human face restoration with more image details and less noise than state-of-the-art image restoration models.

Ensemble Learning Face Recognition +1

Conditional Image-to-Image Translation

no code implementations CVPR 2018 Jianxin Lin, Yingce Xia, Tao Qin, Zhibo Chen, Tie-Yan Liu

In this paper, we study a new problem, conditional image-to-image translation, which is to translate an image from the source domain to the target domain conditioned on a given image in the target domain.

Image-to-Image Translation Translation

Learning for Video Compression

no code implementations26 Apr 2018 Zhibo Chen, Tianyu He, Xin Jin, Feng Wu

One key challenge to learning-based video compression is that motion predictive coding, a very effective tool for video compression, can hardly be trained into a neural network.

Multimedia Image and Video Processing

Cannot find the paper you are looking for? You can Submit a new open access paper.