Search Results for author: Wen Gao

Found 93 papers, 30 papers with code

ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation

3 code implementations • 23 Dec 2021 • Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, dianhai yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang

A unified framework named ERNIE 3. 0 was recently proposed for pre-training large-scale knowledge enhanced models and trained a model with 10 billion parameters.

Language Modelling

11,411

Paper
Code

Instance-Aware Dynamic Neural Network Quantization

4 code implementations • CVPR 2022 • Zhenhua Liu, Yunhe Wang, Kai Han, Siwei Ma, Wen Gao

However, natural images are of huge diversity with abundant content and using such a universal quantization configuration for all samples is not an optimal strategy.

Quantization

1,111

Paper
Code

Person Transfer GAN to Bridge Domain Gap for Person Re-Identification

25 code implementations • CVPR 2018 • Longhui Wei, Shiliang Zhang, Wen Gao, Qi Tian

Although the performance of person Re-Identification (ReID) has been significantly boosted, many challenging issues in real scenarios have not been fully investigated, e. g., the complex scenes and lighting variations, viewpoint and pose changes, and the large number of identities in a camera network.

Ranked #11 on Unsupervised Person Re-Identification on DukeMTMC-reID (Rank-10 metric)

Generative Adversarial Network Person Re-Identification +1

463

Paper
Code

Pre-Trained Image Processing Transformer

6 code implementations • CVPR 2021 • Hanting Chen, Yunhe Wang, Tianyu Guo, Chang Xu, Yiping Deng, Zhenhua Liu, Siwei Ma, Chunjing Xu, Chao Xu, Wen Gao

To maximally excavate the capability of transformer, we present to utilize the well-known ImageNet benchmark for generating a large amount of corrupted image pairs.

Ranked #1 on Single Image Deraining on Rain100L (using extra training data)

Color Image Denoising Contrastive Learning +2

403

Paper
Code

HiFaceGAN: Face Renovation via Collaborative Suppression and Replenishment

5 code implementations • 11 May 2020 • Lingbo Yang, Chang Liu, Pan Wang, Shanshe Wang, Peiran Ren, Siwei Ma, Wen Gao

Existing face restoration researches typically relies on either the degradation prior or explicit guidance labels for training, which often results in limited generalization ability over real-world images with heterogeneous degradations and rich background contents.

Ranked #1 on Image Super-Resolution on FFHQ 256 x 256 - 4x upscaling

Blind Face Restoration Face Hallucination +3

282

Paper
Code

Implicit Subspace Prior Learning for Dual-Blind Face Restoration

1 code implementation • 12 Oct 2020 • Lingbo Yang, Pan Wang, Zhanning Gao, Shanshe Wang, Peiran Ren, Siwei Ma, Wen Gao

Face restoration is an inherently ill-posed problem, where additional prior constraints are typically considered crucial for mitigating such pathology.

Blind Face Restoration

282

Paper
Code

Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey

1 code implementation • 20 Feb 2023 • Xiao Wang, Guangyao Chen, Guangwu Qian, Pengcheng Gao, Xiao-Yong Wei, YaoWei Wang, Yonghong Tian, Wen Gao

We also give visualization and analysis of the model parameters and results on representative downstream tasks.

250

Paper
Code

P-STMO: Pre-Trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation

1 code implementation • 15 Mar 2022 • Wenkang Shan, Zhenhua Liu, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao

In Stage II, the pre-trained encoder is loaded to STMO model and fine-tuned.

Ranked #10 on Monocular 3D Human Pose Estimation on Human3.6M

Denoising Monocular 3D Human Pose Estimation

138

Paper
Code

Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation

1 code implementation • ICCV 2023 • Wenkang Shan, Zhenhua Liu, Xinfeng Zhang, Zhao Wang, Kai Han, Shanshe Wang, Siwei Ma, Wen Gao

On the other hand, JPMA is proposed to assemble multiple hypotheses generated by D3DP into a single 3D pose for practical use.

Ranked #2 on Multi-Hypotheses 3D Human Pose Estimation on Human3.6M

3D Pose Estimation Monocular 3D Human Pose Estimation +1

128

Paper
Code

Improving Robustness and Accuracy via Relative Information Encoding in 3D Human Pose Estimation

1 code implementation • 29 Jul 2021 • Wenkang Shan, Haopeng Lu, Shanshe Wang, Xinfeng Zhang, Wen Gao

To alleviate these two problems, we propose a relative information encoding method that yields positional and temporal enhanced representations.

Ranked #13 on Monocular 3D Human Pose Estimation on Human3.6M

Monocular 3D Human Pose Estimation

Paper
Code

A Survey on Temporal Knowledge Graph Completion: Taxonomy, Progress, and Prospects

1 code implementation • 4 Aug 2023 • Jiapu Wang, Boyue Wang, Meikang Qiu, Shirui Pan, Bo Xiong, Heng Liu, Linhao Luo, Tengfei Liu, Yongli Hu, BaoCai Yin, Wen Gao

Temporal characteristics are prominently evident in a substantial volume of knowledge, which underscores the pivotal role of Temporal Knowledge Graphs (TKGs) in both academia and industry.

Missing Elements Temporal Knowledge Graph Completion

Paper
Code

Towards End-to-End Image Compression and Analysis with Transformers

1 code implementation • 17 Dec 2021 • Yuanchao Bai, Xu Yang, Xianming Liu, Junjun Jiang, YaoWei Wang, Xiangyang Ji, Wen Gao

Meanwhile, we propose a feature aggregation module to fuse the compressed features with the selected intermediate features of the Transformer, and feed the aggregated features to a deconvolutional neural network for image reconstruction.

Classification Image Classification +3

Paper
Code

MAU: A Motion-Aware Unit for Video Prediction and Beyond

1 code implementation • NeurIPS 2021 • Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Yan Ye, Xiang Xinguang, Wen Gao

The attention module aims to learn an attention map based on the correlations between the current spatial state and the historical spatial states.

Ranked #18 on Video Prediction on Moving MNIST

Action Recognition Video Prediction

Paper
Code

Group-based Sparse Representation for Image Restoration

1 code implementation • 14 May 2014 • Jian Zhang, Debin Zhao, Wen Gao

In this paper, instead of using patch as the basic unit of sparse representation, we exploit the concept of group as the basic unit of sparse representation, which is composed of nonlocal patches with similar structures, and establish a novel sparse representation modeling of natural images, called group-based sparse representation (GSR).

Compressive Sensing Deblurring +4

Paper
Code

SACNN: Self-Attention Convolutional Neural Network for Low-Dose CT Denoising With Self-Supervised Perceptual Loss Network

1 code implementation • IEEE Transactions on Medical Imaging 2020 • Meng Li, William Hsu, Xiaodong Xie, Jason Cong, Wen Gao

We combine these two methods and demonstrate their effectiveness on both CNN-based neural networks and WGAN-based neural networks with comprehensive experiments.

Computed Tomography (CT) Denoising +1

Paper
Code

Direct Speech-to-image Translation

1 code implementation • 7 Apr 2020 • Jiguo Li, Xinfeng Zhang, Chuanmin Jia, Jizheng Xu, Li Zhang, Yue Wang, Siwei Ma, Wen Gao

In this paper, we attempt to translate the speech signals into the image signals without the transcription stage.

Multimedia Sound Audio and Speech Processing

Paper
Code

Universal Adversarial Perturbations Generative Network for Speaker Recognition

1 code implementation • 7 Apr 2020 • Jiguo Li, Xinfeng Zhang, Chuanmin Jia, Jizheng Xu, Li Zhang, Yue Wang, Siwei Ma, Wen Gao

Attacking deep learning based biometric systems has drawn more and more attention with the wide deployment of fingerprint/face/speaker recognition systems, given the fact that the neural networks are vulnerable to the adversarial examples, which have been intentionally perturbed to remain almost imperceptible for human.

Speaker Recognition

Paper
Code

Iterative Network for Image Super-Resolution

1 code implementation • 20 May 2020 • Yuqing Liu, Shiqi Wang, Jian Zhang, Shanshe Wang, Siwei Ma, Wen Gao

A novel iterative super-resolution network (ISRN) is proposed on top of the iterative optimization.

Image Super-Resolution SSIM

Paper
Code

Segatron: Segment-Aware Transformer for Language Modeling and Understanding

1 code implementation • 30 Apr 2020 • He Bai, Peng Shi, Jimmy Lin, Yuqing Xie, Luchen Tan, Kun Xiong, Wen Gao, Ming Li

To verify this, we propose a segment-aware Transformer (Segatron), by replacing the original token position encoding with a combined position encoding of paragraph, sentence, and token.

Ranked #20 on Language Modelling on WikiText-103

Language Modelling Masked Language Modeling +3

Paper
Code

STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction

1 code implementation • CVPR 2022 • Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao

In this paper, we propose a Spatiotemporal Residual Predictive Model (STRPM) for high-resolution video prediction.

4k Video Prediction +1

Paper
Code

STIP: A SpatioTemporal Information-Preserving and Perception-Augmented Model for High-Resolution Video Prediction

1 code implementation • 9 Jun 2022 • Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao

To solve the information loss problem, the proposed model aims to preserve the spatiotemporal information for videos during the feature extraction and the state transitions, respectively.

Video Prediction

Paper
Code

Learning to fool the speaker recognition

1 code implementation • 7 Apr 2020 • Jiguo Li, Xinfeng Zhang, Jizheng Xu, Li Zhang, Yue Wang, Siwei Ma, Wen Gao

Due to the widespread deployment of fingerprint/face/speaker recognition systems, attacking deep learning based biometric systems has drawn more and more attention.

Audio and Speech Processing Cryptography and Security Sound

Paper
Code

Region-adaptive Texture Enhancement for Detailed Person Image Synthesis

1 code implementation • 26 May 2020 • Lingbo Yang, Pan Wang, Xinfeng Zhang, Shanshe Wang, Zhanning Gao, Peiran Ren, Xuansong Xie, Siwei Ma, Wen Gao

The ability to produce convincing textural details is essential for the fidelity of synthesized person images.

Ranked #4 on Pose Transfer on Deep-Fashion

Pose Transfer

Paper
Code

Lightweight super resolution network for point cloud geometry compression

1 code implementation • 2 Nov 2023 • Wei zhang, Dingquan Li, Ge Li, Wen Gao

This paper presents an approach for compressing point cloud geometry by leveraging a lightweight super-resolution network.

Point cloud reconstruction Point Cloud Super Resolution +1

Paper
Code

Deep Lossy Plus Residual Coding for Lossless and Near-lossless Image Compression

1 code implementation • 11 Sep 2022 • Yuanchao Bai, Xianming Liu, Kai Wang, Xiangyang Ji, Xiaolin Wu, Wen Gao

In the lossless mode, the DLPR coding system first performs lossy compression and then lossless coding of residuals.

Image Compression

Paper
Code

Perceptual Video Coding for Machines via Satisfied Machine Ratio Modeling

1 code implementation • 13 Nov 2022 • Qi Zhang, Shanshe Wang, Xinfeng Zhang, Chuanmin Jia, Zhao Wang, Siwei Ma, Wen Gao

Each score is derived from machine perceptual differences between original and compressed images.

Image Classification object-detection +3

Paper
Code

Textural-Perceptual Joint Learning for No-Reference Super-Resolution Image Quality Assessment

1 code implementation • 27 May 2022 • Yuqing Liu, Qi Jia, Shanshe Wang, Siwei Ma, Wen Gao

Image super-resolution (SR) has been widely investigated in recent years.

Image Quality Assessment Image Super-Resolution

Paper
Code

Semantics of the Unwritten: The Effect of End of Paragraph and Sequence Tokens on Text Generation with GPT2

1 code implementation • ACL 2021 • He Bai, Peng Shi, Jimmy Lin, Luchen Tan, Kun Xiong, Wen Gao, Jie Liu, Ming Li

Experimental results show that the Chinese GPT2 can generate better essay endings with \eop.

Language Modelling Story Generation

Paper
Code

Learning Weighting Map for Bit-Depth Expansion within a Rational Range

1 code implementation • 26 Apr 2022 • Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao

Existing BDE methods have no unified solution for various BDE situations, and directly learn a mapping for each pixel from LBD image to the desired value in HBD image, which may change the given high-order bits and lead to a huge deviation from the ground truth.

SSIM

Paper
Code

Globally Variance-Constrained Sparse Representation and Its Application in Image Set Coding

no code implementations • 17 Aug 2016 • Xiang Zhang, Jiarui Sun, Siwei Ma, Zhouchen Lin, Jian Zhang, Shiqi Wang, Wen Gao

Therefore, introducing an accurate rate-constraint in sparse coding and dictionary learning becomes meaningful, which has not been fully exploited in the context of sparse representation.

Data Compression Dictionary Learning

Paper
Add Code

Graph-Based Blind Image Deblurring From a Single Photograph

no code implementations • 22 Feb 2018 • Yuanchao Bai, Gene Cheung, Xian-Ming Liu, Wen Gao

We leverage the new graph spectral interpretation for RGTV to design an efficient algorithm that solves for the skeleton image and the blur kernel alternately.

Blind Image Deblurring Image Deblurring

Paper
Add Code

Blind Image Deblurring via Reweighted Graph Total Variation

no code implementations • 24 Dec 2017 • Yuanchao Bai, Gene Cheung, Xian-Ming Liu, Wen Gao

The problem can be solved in two parts: i) estimate a blur kernel from the blurry image, and ii) given estimated blur kernel, de-convolve blurry input to restore the target image.

Blind Image Deblurring Image Deblurring

Paper
Add Code

LVreID: Person Re-Identification with Long Sequence Videos

no code implementations • 20 Dec 2017 • Jianing Li, Shiliang Zhang, Jingdong Wang, Wen Gao, Qi Tian

This paper mainly establishes a large-scale Long sequence Video database for person re-IDentification (LVreID).

Person Re-Identification

Paper
Add Code

AI Oriented Large-Scale Video Management for Smart City: Technologies, Standards and Beyond

no code implementations • 5 Dec 2017 • Ling-Yu Duan, Yihang Lou, Shiqi Wang, Wen Gao, Yong Rui

To practically facilitate deep neural network models in the large-scale video analysis, there are still unprecedented challenges for the large-scale video data management.

Management

Paper
Add Code

A Bio-Inspired Multi-Exposure Fusion Framework for Low-light Image Enhancement

no code implementations • 2 Nov 2017 • Zhenqiang Ying, Ge Li, Wen Gao

Inspired by human visual system, we design a multi-exposure fusion framework for low-light image enhancement.

Low-Light Image Enhancement

Paper
Add Code

Pose-driven Deep Convolutional Model for Person Re-identification

no code implementations • ICCV 2017 • Chi Su, Jianing Li, Shiliang Zhang, Junliang Xing, Wen Gao, Qi Tian

Our deep architecture explicitly leverages the human part cues to alleviate the pose variations and learn robust feature representations from both the global image and different local parts.

Ranked #105 on Person Re-Identification on Market-1501

Person Re-Identification

Paper
Add Code

GLAD: Global-Local-Alignment Descriptor for Pedestrian Retrieval

no code implementations • 13 Sep 2017 • Longhui Wei, Shiliang Zhang, Hantao Yao, Wen Gao, Qi Tian

Targeting to solve these problems, this work proposes a Global-Local-Alignment Descriptor (GLAD) and an efficient indexing and retrieval framework, respectively.

Ranked #93 on Person Re-Identification on Market-1501

Person Re-Identification Representation Learning +1

Paper
Add Code

Performance Guaranteed Network Acceleration via High-Order Residual Quantization

no code implementations • ICCV 2017 • Zefan Li, Bingbing Ni, Wenjun Zhang, Xiaokang Yang, Wen Gao

Input binarization has shown to be an effective way for network acceleration.

Binarization Quantization +1

Paper
Add Code

Beyond Monte Carlo Tree Search: Playing Go with Deep Alternative Neural Network and Long-Term Evaluation

no code implementations • 13 Jun 2017 • Jinzhuo Wang, Wenmin Wang, Ronggang Wang, Wen Gao

We show such setting can preserve more contexts of local features and its evolutions which are beneficial for move prediction.

Paper
Add Code

An Attention-Driven Approach of No-Reference Image Quality Assessment

no code implementations • 12 Dec 2016 • Diqi Chen, Yizhou Wang, Tianfu Wu, Wen Gao

The model learning is implemented by a reinforcement strategy, in which the rewards of both tasks guide the learning of the optimal sampling policy to acquire the "task-informative" image regions so that the predictions can be made accurately and efficiently (in terms of the sampling steps).

Multi-Task Learning No-Reference Image Quality Assessment +2

Paper
Add Code

Compact Descriptors for Video Analysis: the Emerging MPEG Standard

no code implementations • 26 Apr 2017 • Ling-Yu Duan, Vijay Chandrasekhar, Shiqi Wang, Yihang Lou, Jie Lin, Yan Bai, Tiejun Huang, Alex ChiChung Kot, Wen Gao

This paper provides an overview of the on-going compact descriptors for video analysis standard (CDVA) from the ISO/IEC moving pictures experts group (MPEG).

Paper
Add Code

Learning a collaborative multiscale dictionary based on robust empirical mode decomposition

no code implementations • 4 Apr 2017 • Rui Chen, Huizhu Jia, Xiaodong Xie, Wen Gao

The multiscale dictionary is considered as the product of oscillating dictionary and tolerance dictionary.

Clustering Dictionary Learning +1

Paper
Add Code

Local Patch Encoding-Based Method for Single Image Super-Resolution

no code implementations • 12 Mar 2017 • Yang Zhao, Ronggang Wang, Wei Jia, Jianchao Yang, Wenmin Wang, Wen Gao

The proposed method consists of a learning stage and a reconstructing stage.

Dictionary Learning Image Super-Resolution

Paper
Add Code

Correlation Preserving Sparse Coding Over Multi-level Dictionaries for Image Denoising

no code implementations • 23 Dec 2016 • Rui Chen, Huizhu Jia, Xiaodong Xie, Wen Gao

In this letter, we propose a novel image denoising method based on correlation preserving sparse coding.

Image Denoising

Paper
Add Code

Blind restoration for non-uniform aerial images using non-local Retinex model and shearlet-based higher-order regularization

no code implementations • 23 Dec 2016 • Rui Chen, Huizhu Jia, Xiaodong Xie, Wen Gao

Aerial images are often degraded by space-varying motion blur and simultaneous uneven illumination.

Deblurring

Paper
Add Code

Deep Attributes Driven Multi-Camera Person Re-identification

no code implementations • 11 May 2016 • Chi Su, Shiliang Zhang, Junliang Xing, Wen Gao, Qi Tian

And we propose a semi-supervised attribute learning framework which progressively boosts the accuracy of attributes only using a limited number of labeled data.

Attribute Metric Learning +1

Paper
Add Code

Maximal Sparsity with Deep Networks?

no code implementations • NeurIPS 2016 • Bo Xin, Yizhou Wang, Wen Gao, David Wipf

The iterations of many sparse estimation algorithms are comprised of a fixed linear filter cascaded with a thresholding nonlinearity, which collectively resemble a typical neural network layer.

Paper
Add Code

Cross-pose Face Recognition by Canonical Correlation Analysis

no code implementations • 29 Jul 2015 • Annan Li, Shiguang Shan, Xilin Chen, Bingpeng Ma, Shuicheng Yan, Wen Gao

We argue that one of the diffculties in this problem is the severe misalignment in face images or feature vectors with different poses.

Face Recognition

Paper
Add Code

Background Subtraction via Generalized Fused Lasso Foreground Modeling

no code implementations • CVPR 2015 • Bo Xin, Yuan Tian, Yizhou Wang, Wen Gao

Background Subtraction (BS) is one of the key steps in video analysis.

Foreground Segmentation

Paper
Add Code

Stable Feature Selection from Brain sMRI

no code implementations • 25 Mar 2015 • Bo Xin, Lingjing Hu, Yizhou Wang, Wen Gao

Neuroimage analysis usually involves learning thousands or even millions of variables using only a limited number of samples.

feature selection

Paper
Add Code

Robust Estimation of 3D Human Poses from a Single Image

no code implementations • CVPR 2014 • Chunyu Wang, Yizhou Wang, Zhouchen Lin, Alan L. Yuille, Wen Gao

We address the challenges in three ways: (i) We represent a 3D pose as a linear combination of a sparse set of bases learned from 3D human skeletons.

Ranked #27 on 3D Human Pose Estimation on HumanEva-I

3D Human Pose Estimation 3D Pose Estimation +2

Paper
Add Code

Image Restoration Using Joint Statistical Modeling in Space-Transform Domain

no code implementations • 11 May 2014 • Jian Zhang, Debin Zhao, Ruiqin Xiong, Siwei Ma, Wen Gao

This paper presents a novel strategy for high-fidelity image restoration by characterizing both local smoothness and nonlocal self-similarity of natural images in a unified statistical manner.

Deblurring Image Deblurring +3

Paper
Add Code

Image Compressive Sensing Recovery Using Adaptively Learned Sparsifying Basis via L0 Minimization

no code implementations • 30 Apr 2014 • Jian Zhang, Chen Zhao, Debin Zhao, Wen Gao

From many fewer acquired measurements than suggested by the Nyquist sampling theory, compressive sensing (CS) theory demonstrates that, a signal can be reconstructed with high probability when it exhibits sparsity in some domain.

Blocking Compressive Sensing

Paper
Add Code

Structural Group Sparse Representation for Image Compressive Sensing Recovery

no code implementations • 29 Apr 2014 • Jian Zhang, Debin Zhao, Feng Jiang, Wen Gao

Compressive Sensing (CS) theory shows that a signal can be decoded from many fewer measurements than suggested by the Nyquist sampling theory, when the signal is sparse in some domain.

Compressive Sensing

Paper
Add Code

RAM: A Region-Aware Deep Model for Vehicle Re-Identification

no code implementations • 25 Jun 2018 • Xiaobin Liu, Shiliang Zhang, Qingming Huang, Wen Gao

Specifically, in addition to extracting global features, RAM also extracts features from a series of local regions.

Vehicle Re-Identification

Paper
Add Code

Computed Tomography Image Enhancement using 3D Convolutional Neural Network

no code implementations • 18 Jul 2018 • Meng Li, Shiwen Shen, Wen Gao, William Hsu, Jason Cong

Computed tomography (CT) is increasingly being used for cancer screening, such as early detection of lung cancer.

Computed Tomography (CT) Image Enhancement +1

Paper
Add Code

Attention Driven Person Re-identification

no code implementations • 13 Oct 2018 • Fan Yang, Ke Yan, Shijian Lu, Huizhu Jia, Xiaodong Xie, Wen Gao

Person re-identification (ReID) is a challenging task due to arbitrary human pose variations, background clutters, etc.

Person Re-Identification

Paper
Add Code

Deep Alternative Neural Network: Exploring Contexts as Early as Possible for Action Recognition

no code implementations • NeurIPS 2016 • Jinzhuo Wang, Wenmin Wang, Xiongtao Chen, Ronggang Wang, Wen Gao

This paper instead explores contexts as early as possible and leverages their evolutions for action recognition.

Action Recognition Optical Flow Estimation +1

Paper
Add Code

Depth-Aware Stereo Video Retargeting

no code implementations • CVPR 2018 • Bing Li, Chia-Wen Lin, Boxin Shi, Tiejun Huang, Wen Gao, C. -C. Jay Kuo

As compared with traditional video retargeting, stereo video retargeting poses new challenges because stereo video contains the depth information of salient objects and its time dynamics.

Paper
Add Code

Image Denoising via Adaptive Soft-Thresholding Based on Non-Local Samples

no code implementations • CVPR 2015 • Hangfan Liu, Ruiqin Xiong, Jian Zhang, Wen Gao

To estimate the expectation and variance parameters for the transform bands of a particular patch, we exploit the non-local correlation of image and collect a set of similar patches as data samples to form the distribution.

Image Denoising

Paper
Add Code

Multi-Task Learning With Low Rank Attribute Embedding for Person Re-Identification

no code implementations • ICCV 2015 • Chi Su, Fan Yang, Shiliang Zhang, Qi Tian, Larry S. Davis, Wen Gao

Since attributes are generally correlated, we introduce a low rank attribute embedding into the MTL formulation to embed original binary attributes to a continuous attribute space, where incorrect and incomplete attributes are rectified and recovered to better describe people.

Attribute Multi-Task Learning +1

Paper
Add Code

Scalable Facial Image Compression with Deep Feature Reconstruction

no code implementations • 14 Mar 2019 • Shurun Wang, Shiqi Wang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao

In this paper, we propose a scalable image compression scheme, including the base layer for feature representation and enhancement layer for texture representation.

Image Compression

Paper
Add Code

Self-critical n-step Training for Image Captioning

no code implementations • CVPR 2019 • Junlong Gao, Shiqi Wang, Shanshe Wang, Siwei Ma, Wen Gao

Existing methods for image captioning are usually trained by cross entropy loss, which leads to exposure bias and the inconsistency between the optimizing function and evaluation metrics.

Image Captioning

Paper
Add Code

Masked Non-Autoregressive Image Captioning

no code implementations • 3 Jun 2019 • Junlong Gao, Xi Meng, Shiqi Wang, Xia Li, Shanshe Wang, Siwei Ma, Wen Gao

Existing captioning models often adopt the encoder-decoder architecture, where the decoder uses autoregressive decoding to generate captions, such that each token is generated sequentially given the preceding generated tokens.

Image Captioning Machine Translation +1

Paper
Add Code

Single Image Blind Deblurring Using Multi-Scale Latent Structure Prior

no code implementations • 11 Jun 2019 • Yuanchao Bai, Huizhu Jia, Ming Jiang, Xian-Ming Liu, Xiaodong Xie, Wen Gao

Blind image deblurring is a challenging problem in computer vision, which aims to restore both the blur kernel and the latent sharp image from only a blurry observation.

Blind Image Deblurring Image Deblurring +3

Paper
Add Code

FedHealth: A Federated Transfer Learning Framework for Wearable Healthcare

no code implementations • 22 Jul 2019 • Yiqiang Chen, Jindong Wang, Chaohui Yu, Wen Gao, Xin Qin

It is able to achieve accurate and personalized healthcare without compromising privacy and security.

Activity Recognition Federated Learning +2

Paper
Add Code

Towards Digital Retina in Smart Cities: A Model Generation, Utilization and Communication Paradigm

1 code implementation • 31 Jul 2019 • Yihang Lou, Ling-Yu Duan, Yong Luo, Ziqian Chen, Tongliang Liu, Shiqi Wang, Wen Gao

The digital retina in smart cities is to select what the City Eye tells the City Brain, and convert the acquired visual data from front-end visual sensors to features in an intelligent sensing manner.

Paper
Code

Global-Local Temporal Representations For Video Person Re-Identification

no code implementations • ICCV 2019 • Jianing Li, Jingdong Wang, Qi Tian, Wen Gao, Shiliang Zhang

The long-term relations are captured by a temporal self-attention model to alleviate the occlusions and noises in video sequences.

Metric Learning Re-Ranking +1

Paper
Add Code

Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics

no code implementations • 10 Jan 2020 • Ling-Yu Duan, Jiaying Liu, Wenhan Yang, Tiejun Huang, Wen Gao

Meanwhile, we systematically review state-of-the-art techniques in video compression and feature compression from the unique perspective of MPEG standardization, which provides the academic and industrial evidence to realize the collaborative compression of video and feature streams in a broad range of AI applications.

Feature Compression Video Compression

Paper
Add Code

Rectified Meta-Learning from Noisy Labels for Robust Image-based Plant Disease Diagnosis

no code implementations • 17 Mar 2020 • Ruifeng Shi, Deming Zhai, Xian-Ming Liu, Junjun Jiang, Wen Gao

However, the performance of CNN-based classification approach depends on a large amount of high-quality manually labeled training data, which are inevitably introduced noise on labels in practice, leading to model overfitting and performance degradation.

General Classification Image Classification +1

Paper
Add Code

Towards Analysis-friendly Face Representation with Scalable Feature and Texture Compression

no code implementations • 21 Apr 2020 • Shurun Wang, Shiqi Wang, Wenhan Yang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao

In particular, we study the feature and texture compression in a scalable coding framework, where the base layer serves as the deep learning feature and enhancement layer targets to perfectly reconstruct the texture.

Image Compression

Paper
Add Code

Location-Aware Feature Selection Text Detection Network

no code implementations • 23 Apr 2020 • Zengyuan Guo, Zilin Wang, Zhihui Wang, Wanli Ouyang, Haojie Li, Wen Gao

However, they are behind in accuracy comparing with recent segmentation-based text detectors.

feature selection regression +2

Paper
Add Code

Towards Fine-grained Human Pose Transfer with Detail Replenishing Network

no code implementations • 26 May 2020 • Lingbo Yang, Pan Wang, Chang Liu, Zhanning Gao, Peiran Ren, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Xian-Sheng Hua, Wen Gao

Human pose transfer (HPT) is an emerging research topic with huge potential in fashion design, media production, online advertising and virtual reality.

Pose Transfer Retrieval

Paper
Add Code

Sequential Hierarchical Learning with Distribution Transformation for Image Super-Resolution

no code implementations • 19 Jul 2020 • Yuqing Liu, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao

Based on the observation, in this paper, we build a sequential hierarchical learning super-resolution network (SHSR) for effective image SR.

Ranked #9 on Image Super-Resolution on Manga109 - 3x upscaling

Image Restoration Image Super-Resolution +1

Paper
Add Code

Intrinsic Temporal Regularization for High-resolution Human Video Synthesis

no code implementations • 11 Dec 2020 • Lingbo Yang, Zhanning Gao, Peiran Ren, Siwei Ma, Wen Gao

Temporal consistency is crucial for extending image processing pipelines to the video domain, which is often enforced with flow-based warping error over adjacent frames.

Motion Estimation Vocal Bursts Intensity Prediction

Paper
Add Code

Recent Standard Development Activities on Video Coding for Machines

no code implementations • 26 May 2021 • Wen Gao, Shan Liu, Xiaozhong Xu, Manouchehr Rafie, Yuan Zhang, Igor Curcio

Specifically, we will first provide an overview of the MPEG VCM group including use cases, requirements, processing pipelines, plan for potential VCM standards, followed by the evaluation framework including machine-vision tasks, dataset, evaluation metrics, and anchor generation.

object-detection Object Detection

Paper
Add Code

Progressive Stage-wise Learning for Unsupervised Feature Representation Enhancement

no code implementations • CVPR 2021 • Zefan Li, Chenxi Liu, Alan Yuille, Bingbing Ni, Wenjun Zhang, Wen Gao

For a given unsupervised task, we design multilevel tasks and define different learning stages for the deep network.

Paper
Add Code

Rate Distortion Characteristic Modeling for Neural Image Compression

no code implementations • 24 Jun 2021 • Chuanmin Jia, Ziqing Ge, Shanshe Wang, Siwei Ma, Wen Gao

End-to-end optimized neural image compression (NIC) has obtained superior lossy compression performance recently.

Image Compression

Paper
Add Code

Post-Training Quantization for Vision Transformer

no code implementations • NeurIPS 2021 • Zhenhua Liu, Yunhe Wang, Kai Han, Siwei Ma, Wen Gao

Recently, transformer has achieved remarkable performance on a variety of computer vision applications.

Quantization

Paper
Add Code

Knowledge Transfer via Student-Teacher Collaboration

no code implementations • 25 Sep 2019 • Tianxiao Gao, Ruiqin Xiong, Zhenhua Liu, Siwei Ma, Feng Wu, Tiejun Huang, Wen Gao

One way to compress these heavy models is knowledge transfer (KT), in which a light student network is trained through absorbing the knowledge from a powerful teacher network.

Transfer Learning

Paper
Add Code

Cross-SRN: Structure-Preserving Super-Resolution Network with Cross Convolution

no code implementations • 5 Jan 2022 • Yuqing Liu, Qi Jia, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao

It is challenging to restore low-resolution (LR) images to super-resolution (SR) images with correct and clear details.

Super-Resolution

Paper
Add Code

Gradient Correction beyond Gradient Descent

no code implementations • 16 Mar 2022 • Zefan Li, Bingbing Ni, Teng Li, Wenjun Zhang, Wen Gao

GCGD consists of two plug-in modules: 1) inspired by the idea of gradient prediction, we propose a \textbf{GC-W} module for weight gradient correction; 2) based on Neural ODE, we propose a \textbf{GC-ODE} module for hidden states gradient correction.

Paper
Add Code

STAU: A SpatioTemporal-Aware Unit for Video Prediction and Beyond

no code implementations • 20 Apr 2022 • Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao

In this paper, we propose a SpatioTemporal-Aware Unit (STAU) for video prediction and beyond by exploring the significant spatiotemporal correlations in videos.

Action Recognition object-detection +2

Paper
Add Code

Hierarchical Similarity Learning for Aliasing Suppression Image Super-Resolution

no code implementations • 7 Jun 2022 • Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao

As a highly ill-posed issue, single image super-resolution (SISR) has been widely investigated in recent years.

Image Super-Resolution

Paper
Add Code

Towards Hybrid-Optimization Video Coding

no code implementations • 12 Jul 2022 • Shuai Huo, Dong Liu, Li Li, Siwei Ma, Feng Wu, Wen Gao

Our idea is to provide multiple discrete starting points in the global space and optimize the local optimum around each point by numerical algorithm efficiently.

Paper
Add Code

Cross Modal Compression: Towards Human-comprehensible Semantic Compression

no code implementations • 6 Sep 2022 • Jiguo Li, Chuanmin Jia, Xinfeng Zhang, Siwei Ma, Wen Gao

With the recent advances in cross modal translation and generation, in this paper, we propose the cross modal compression~(CMC), a semantic compression framework for visual data, to transform the high redundant visual data~(such as image, video, etc.)

Feature Compression Video Compression

Paper
Add Code

Learning to Compress Unmanned Aerial Vehicle (UAV) Captured Video: Benchmark and Analysis

no code implementations • 15 Jan 2023 • Chuanmin Jia, Feng Ye, Huifang Sun, Siwei Ma, Wen Gao

During the past decade, the Unmanned-Aerial-Vehicles (UAVs) have attracted increasing attention due to their flexible, extensive, and dynamic space-sensing capabilities.

Video Compression

Paper
Add Code

SpikeCodec: An End-to-end Learned Compression Framework for Spiking Camera

no code implementations • 25 Jun 2023 • Kexiang Feng, Chuanmin Jia, Siwei Ma, Wen Gao

Recently, the bio-inspired spike camera with continuous motion recording capability has attracted tremendous attention due to its ultra high temporal resolution imaging characteristic.

Data Compression

Paper
Add Code

CSSL-RHA: Contrastive Self-Supervised Learning for Robust Handwriting Authentication

no code implementations • 18 Jul 2023 • Jingyao Wang, Luntian Mou, Changwen Zheng, Wen Gao

In this paper, we propose a novel Contrastive Self-Supervised Learning framework for Robust Handwriting Authentication (CSSL-RHA) to address these issues.

Self-Supervised Learning

Paper
Add Code

Intelligence-Endogenous Management Platform for Computing and Network Convergence

no code implementations • 7 Aug 2023 • Zicong Hong, Xiaoyu Qiu, Jian Lin, Wuhui Chen, Yue Yu, Hui Wang, Song Guo, Wen Gao

Therefore, in this article, we present the concept of an intelligence-endogenous management platform for CNCs called \emph{CNC brain} based on artificial intelligence technologies.

Management Scheduling

Paper
Add Code

AI Alignment: A Comprehensive Survey

no code implementations • 30 Oct 2023 • Jiaming Ji, Tianyi Qiu, Boyuan Chen, Borong Zhang, Hantao Lou, Kaile Wang, Yawen Duan, Zhonghao He, Jiayi Zhou, Zhaowei Zhang, Fanzhi Zeng, Kwan Yee Ng, Juntao Dai, Xuehai Pan, Aidan O'Gara, Yingshan Lei, Hua Xu, Brian Tse, Jie Fu, Stephen Mcaleer, Yaodong Yang, Yizhou Wang, Song-Chun Zhu, Yike Guo, Wen Gao

The former aims to make AI systems aligned via alignment training, while the latter aims to gain evidence about the systems' alignment and govern them appropriately to avoid exacerbating misalignment risks.

Paper
Add Code

SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field

no code implementations • 26 Feb 2024 • Zetian Song, Wenhong Duan, Yuhuai Zhang, Shiqi Wang, Siwei Ma, Wen Gao

Representing the Neural Radiance Field (NeRF) with the explicit voxel grid (EVG) is a promising direction for improving NeRFs.

Image Compression Neural Network Compression +1

Paper
Add Code

IME: Integrating Multi-curvature Shared and Specific Embedding for Temporal Knowledge Graph Completion

no code implementations • 28 Mar 2024 • Jiapu Wang, Zheng Cui, Boyue Wang, Shirui Pan, Junbin Gao, BaoCai Yin, Wen Gao

However, existing Temporal Knowledge Graph Completion (TKGC) methods either model TKGs in a single space or neglect the heterogeneity of different curvature spaces, thus constraining their capacity to capture these intricate geometric structures.

Temporal Knowledge Graph Completion

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.