no code implementations • 23 Dec 2024 • Daxin Li, Yuanchao Bai, Kai Wang, Junjun Jiang, Xianming Liu, Wen Gao
To address this challenge, we explore the connection between the Minimum Description Length (MDL) principle and Parameter-Efficient Transfer Learning (PETL), leading to the development of a novel content-adaptive approach for learned lossless image compression, dubbed CALLIC.
no code implementations • 23 Dec 2024 • Qi Zhang, Shanshe Wang, Xinfeng Zhang, Siwei Ma, Jingshan Pan, Wen Gao
It is meaningful to predict the perceptual quality of compressed images for both humans and machines, which guides the optimization for compression.
no code implementations • 19 Aug 2024 • Jingyao Wang, Luntian Mou, Changwen Zheng, Wen Gao
Freeform handwriting authentication verifies a person's identity from their writing style and habits in messy handwriting data.
1 code implementation • 9 Jul 2024 • Yang Liu, Weixing Chen, Yongjie Bai, Xiaodan Liang, Guanbin Li, Wen Gao, Liang Lin
In this survey, we give a comprehensive exploration of the latest advancements in Embodied AI.
1 code implementation • 24 Jun 2024 • Ziguang Li, Chao Huang, Xuliang Wang, Haibo Hu, Cole Wyeth, Dongbo Bu, Quan Yu, Wen Gao, Xingwu Liu, Ming Li
The better a large model understands the data, the better LMCompress compresses.
1 code implementation • 20 Jun 2024 • Sibo Wang, Xiangkui Cao, Jie Zhang, Zheng Yuan, Shiguang Shan, Xilin Chen, Wen Gao
The emergence of Large Vision-Language Models (LVLMs) marks significant strides towards achieving general artificial intelligence.
no code implementations • 2 May 2024 • Daxin Li, Yuanchao Bai, Kai Wang, Junjun Jiang, Xianming Liu, Wen Gao
To further expedite the network inference, we introduce context cache optimization to GroupedMixer, which caches attention activation values in cross-group token-mixers and avoids complex and duplicated computation.
no code implementations • 21 Apr 2024 • Jiyong Ma, Wen Gao, Chunli Wang
In this paper, a novel approach to sign language recognition based on state tying in each of data streams is presented.
no code implementations • 28 Mar 2024 • Jiapu Wang, Zheng Cui, Boyue Wang, Shirui Pan, Junbin Gao, BaoCai Yin, Wen Gao
However, existing Temporal Knowledge Graph Completion (TKGC) methods either model TKGs in a single space or neglect the heterogeneity of different curvature spaces, thus constraining their capacity to capture these intricate geometric structures.
no code implementations • 26 Feb 2024 • Zetian Song, Wenhong Duan, Yuhuai Zhang, Shiqi Wang, Siwei Ma, Wen Gao
Representing the Neural Radiance Field (NeRF) with the explicit voxel grid (EVG) is a promising direction for improving NeRFs.
1 code implementation • 2 Nov 2023 • Wei zhang, Dingquan Li, Ge Li, Wen Gao
This paper presents an approach for compressing point cloud geometry by leveraging a lightweight super-resolution network.
no code implementations • 30 Oct 2023 • Jiaming Ji, Tianyi Qiu, Boyuan Chen, Borong Zhang, Hantao Lou, Kaile Wang, Yawen Duan, Zhonghao He, Jiayi Zhou, Zhaowei Zhang, Fanzhi Zeng, Kwan Yee Ng, Juntao Dai, Xuehai Pan, Aidan O'Gara, Yingshan Lei, Hua Xu, Brian Tse, Jie Fu, Stephen Mcaleer, Yaodong Yang, Yizhou Wang, Song-Chun Zhu, Yike Guo, Wen Gao
The former aims to make AI systems aligned via alignment training, while the latter aims to gain evidence about the systems' alignment and govern them appropriately to avoid exacerbating misalignment risks.
no code implementations • 7 Aug 2023 • Zicong Hong, Xiaoyu Qiu, Jian Lin, Wuhui Chen, Yue Yu, Hui Wang, Song Guo, Wen Gao
Therefore, in this article, we present the concept of an intelligence-endogenous management platform for CNCs called \emph{CNC brain} based on artificial intelligence technologies.
1 code implementation • 4 Aug 2023 • Jiapu Wang, Boyue Wang, Meikang Qiu, Shirui Pan, Bo Xiong, Heng Liu, Linhao Luo, Tengfei Liu, Yongli Hu, BaoCai Yin, Wen Gao
Temporal characteristics are prominently evident in a substantial volume of knowledge, which underscores the pivotal role of Temporal Knowledge Graphs (TKGs) in both academia and industry.
no code implementations • 18 Jul 2023 • Jingyao Wang, Luntian Mou, Changwen Zheng, Wen Gao
In this paper, we propose a novel Contrastive Self-Supervised Learning framework for Robust Handwriting Authentication (CSSL-RHA) to address these issues.
no code implementations • 25 Jun 2023 • Kexiang Feng, Chuanmin Jia, Siwei Ma, Wen Gao
Recently, the bio-inspired spike camera with continuous motion recording capability has attracted tremendous attention due to its ultra high temporal resolution imaging characteristic.
1 code implementation • ICCV 2023 • Wenkang Shan, Zhenhua Liu, Xinfeng Zhang, Zhao Wang, Kai Han, Shanshe Wang, Siwei Ma, Wen Gao
On the other hand, JPMA is proposed to assemble multiple hypotheses generated by D3DP into a single 3D pose for practical use.
1 code implementation • 20 Feb 2023 • Xiao Wang, Guangyao Chen, Guangwu Qian, Pengcheng Gao, Xiao-Yong Wei, YaoWei Wang, Yonghong Tian, Wen Gao
We also give visualization and analysis of the model parameters and results on representative downstream tasks.
no code implementations • 15 Jan 2023 • Chuanmin Jia, Feng Ye, Huifang Sun, Siwei Ma, Wen Gao
During the past decade, the Unmanned-Aerial-Vehicles (UAVs) have attracted increasing attention due to their flexible, extensive, and dynamic space-sensing capabilities.
1 code implementation • 13 Nov 2022 • Qi Zhang, Shanshe Wang, Xinfeng Zhang, Chuanmin Jia, Zhao Wang, Siwei Ma, Wen Gao
Each score is derived from machine perceptual differences between original and compressed images.
1 code implementation • 11 Sep 2022 • Yuanchao Bai, Xianming Liu, Kai Wang, Xiangyang Ji, Xiaolin Wu, Wen Gao
In the lossless mode, the DLPR coding system first performs lossy compression and then lossless coding of residuals.
no code implementations • 6 Sep 2022 • Jiguo Li, Chuanmin Jia, Xinfeng Zhang, Siwei Ma, Wen Gao
With the recent advances in cross modal translation and generation, in this paper, we propose the cross modal compression~(CMC), a semantic compression framework for visual data, to transform the high redundant visual data~(such as image, video, etc.)
no code implementations • 12 Jul 2022 • Shuai Huo, Dong Liu, Li Li, Siwei Ma, Feng Wu, Wen Gao
Our idea is to provide multiple discrete starting points in the global space and optimize the local optimum around each point by numerical algorithm efficiently.
1 code implementation • 9 Jun 2022 • Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
To solve the information loss problem, the proposed model aims to preserve the spatiotemporal information for videos during the feature extraction and the state transitions, respectively.
no code implementations • 7 Jun 2022 • Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao
As a highly ill-posed issue, single image super-resolution (SISR) has been widely investigated in recent years.
1 code implementation • 27 May 2022 • Yuqing Liu, Qi Jia, Shanshe Wang, Siwei Ma, Wen Gao
Image super-resolution (SR) has been widely investigated in recent years.
1 code implementation • 26 Apr 2022 • Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao
Existing BDE methods have no unified solution for various BDE situations, and directly learn a mapping for each pixel from LBD image to the desired value in HBD image, which may change the given high-order bits and lead to a huge deviation from the ground truth.
no code implementations • 20 Apr 2022 • Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
In this paper, we propose a SpatioTemporal-Aware Unit (STAU) for video prediction and beyond by exploring the significant spatiotemporal correlations in videos.
1 code implementation • CVPR 2022 • Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
In this paper, we propose a Spatiotemporal Residual Predictive Model (STRPM) for high-resolution video prediction.
no code implementations • 16 Mar 2022 • Zefan Li, Bingbing Ni, Teng Li, Wenjun Zhang, Wen Gao
GCGD consists of two plug-in modules: 1) inspired by the idea of gradient prediction, we propose a \textbf{GC-W} module for weight gradient correction; 2) based on Neural ODE, we propose a \textbf{GC-ODE} module for hidden states gradient correction.
1 code implementation • 15 Mar 2022 • Wenkang Shan, Zhenhua Liu, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
In Stage II, the pre-trained encoder is loaded to STMO model and fine-tuned.
Ranked #11 on Monocular 3D Human Pose Estimation on Human3.6M
no code implementations • 5 Jan 2022 • Yuqing Liu, Qi Jia, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao
It is challenging to restore low-resolution (LR) images to super-resolution (SR) images with correct and clear details.
4 code implementations • CVPR 2022 • Zhenhua Liu, Yunhe Wang, Kai Han, Siwei Ma, Wen Gao
However, natural images are of huge diversity with abundant content and using such a universal quantization configuration for all samples is not an optimal strategy.
3 code implementations • 23 Dec 2021 • Shuohuan Wang, Yu Sun, Yang Xiang, Zhihua Wu, Siyu Ding, Weibao Gong, Shikun Feng, Junyuan Shang, Yanbin Zhao, Chao Pang, Jiaxiang Liu, Xuyi Chen, Yuxiang Lu, Weixin Liu, Xi Wang, Yangfan Bai, Qiuliang Chen, Li Zhao, Shiyong Li, Peng Sun, dianhai yu, Yanjun Ma, Hao Tian, Hua Wu, Tian Wu, Wei Zeng, Ge Li, Wen Gao, Haifeng Wang
A unified framework named ERNIE 3. 0 was recently proposed for pre-training large-scale knowledge enhanced models and trained a model with 10 billion parameters.
1 code implementation • 17 Dec 2021 • Yuanchao Bai, Xu Yang, Xianming Liu, Junjun Jiang, YaoWei Wang, Xiangyang Ji, Wen Gao
Meanwhile, we propose a feature aggregation module to fuse the compressed features with the selected intermediate features of the Transformer, and feed the aggregated features to a deconvolutional neural network for image reconstruction.
1 code implementation • NeurIPS 2021 • Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Yan Ye, Xiang Xinguang, Wen Gao
The attention module aims to learn an attention map based on the correlations between the current spatial state and the historical spatial states.
Ranked #19 on Video Prediction on Moving MNIST
1 code implementation • 29 Jul 2021 • Wenkang Shan, Haopeng Lu, Shanshe Wang, Xinfeng Zhang, Wen Gao
To alleviate these two problems, we propose a relative information encoding method that yields positional and temporal enhanced representations.
Ranked #14 on Monocular 3D Human Pose Estimation on Human3.6M
no code implementations • NeurIPS 2021 • Zhenhua Liu, Yunhe Wang, Kai Han, Siwei Ma, Wen Gao
Recently, transformer has achieved remarkable performance on a variety of computer vision applications.
no code implementations • 24 Jun 2021 • Chuanmin Jia, Ziqing Ge, Shanshe Wang, Siwei Ma, Wen Gao
End-to-end optimized neural image compression (NIC) has obtained superior lossy compression performance recently.
no code implementations • CVPR 2021 • Zefan Li, Chenxi Liu, Alan Yuille, Bingbing Ni, Wenjun Zhang, Wen Gao
For a given unsupervised task, we design multilevel tasks and define different learning stages for the deep network.
no code implementations • 26 May 2021 • Wen Gao, Shan Liu, Xiaozhong Xu, Manouchehr Rafie, Yuan Zhang, Igor Curcio
Specifically, we will first provide an overview of the MPEG VCM group including use cases, requirements, processing pipelines, plan for potential VCM standards, followed by the evaluation framework including machine-vision tasks, dataset, evaluation metrics, and anchor generation.
no code implementations • 11 Dec 2020 • Lingbo Yang, Zhanning Gao, Peiran Ren, Siwei Ma, Wen Gao
Temporal consistency is crucial for extending image processing pipelines to the video domain, which is often enforced with flow-based warping error over adjacent frames.
6 code implementations • CVPR 2021 • Hanting Chen, Yunhe Wang, Tianyu Guo, Chang Xu, Yiping Deng, Zhenhua Liu, Siwei Ma, Chunjing Xu, Chao Xu, Wen Gao
To maximally excavate the capability of transformer, we present to utilize the well-known ImageNet benchmark for generating a large amount of corrupted image pairs.
Ranked #1 on Single Image Deraining on Rain100L (using extra training data)
1 code implementation • 12 Oct 2020 • Lingbo Yang, Pan Wang, Zhanning Gao, Shanshe Wang, Peiran Ren, Siwei Ma, Wen Gao
Face restoration is an inherently ill-posed problem, where additional prior constraints are typically considered crucial for mitigating such pathology.
no code implementations • 19 Jul 2020 • Yuqing Liu, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
Based on the observation, in this paper, we build a sequential hierarchical learning super-resolution network (SHSR) for effective image SR.
Ranked #15 on Image Super-Resolution on Manga109 - 3x upscaling
1 code implementation • IEEE Transactions on Medical Imaging 2020 • Meng Li, William Hsu, Xiaodong Xie, Jason Cong, Wen Gao
We combine these two methods and demonstrate their effectiveness on both CNN-based neural networks and WGAN-based neural networks with comprehensive experiments.
no code implementations • 26 May 2020 • Lingbo Yang, Pan Wang, Chang Liu, Zhanning Gao, Peiran Ren, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Xian-Sheng Hua, Wen Gao
Human pose transfer (HPT) is an emerging research topic with huge potential in fashion design, media production, online advertising and virtual reality.
1 code implementation • 26 May 2020 • Lingbo Yang, Pan Wang, Xinfeng Zhang, Shanshe Wang, Zhanning Gao, Peiran Ren, Xuansong Xie, Siwei Ma, Wen Gao
The ability to produce convincing textural details is essential for the fidelity of synthesized person images.
Ranked #4 on Pose Transfer on Deep-Fashion
1 code implementation • 20 May 2020 • Yuqing Liu, Shiqi Wang, Jian Zhang, Shanshe Wang, Siwei Ma, Wen Gao
A novel iterative super-resolution network (ISRN) is proposed on top of the iterative optimization.
5 code implementations • 11 May 2020 • Lingbo Yang, Chang Liu, Pan Wang, Shanshe Wang, Peiran Ren, Siwei Ma, Wen Gao
Existing face restoration researches typically relies on either the degradation prior or explicit guidance labels for training, which often results in limited generalization ability over real-world images with heterogeneous degradations and rich background contents.
1 code implementation • 30 Apr 2020 • He Bai, Peng Shi, Jimmy Lin, Yuqing Xie, Luchen Tan, Kun Xiong, Wen Gao, Ming Li
To verify this, we propose a segment-aware Transformer (Segatron), by replacing the original token position encoding with a combined position encoding of paragraph, sentence, and token.
Ranked #20 on Language Modelling on WikiText-103
no code implementations • 23 Apr 2020 • Zengyuan Guo, Zilin Wang, Zhihui Wang, Wanli Ouyang, Haojie Li, Wen Gao
However, they are behind in accuracy comparing with recent segmentation-based text detectors.
no code implementations • 21 Apr 2020 • Shurun Wang, Shiqi Wang, Wenhan Yang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
In particular, we study the feature and texture compression in a scalable coding framework, where the base layer serves as the deep learning feature and enhancement layer targets to perfectly reconstruct the texture.
1 code implementation • 7 Apr 2020 • Jiguo Li, Xinfeng Zhang, Jizheng Xu, Li Zhang, Yue Wang, Siwei Ma, Wen Gao
Due to the widespread deployment of fingerprint/face/speaker recognition systems, attacking deep learning based biometric systems has drawn more and more attention.
Audio and Speech Processing Cryptography and Security Sound
1 code implementation • 7 Apr 2020 • Jiguo Li, Xinfeng Zhang, Chuanmin Jia, Jizheng Xu, Li Zhang, Yue Wang, Siwei Ma, Wen Gao
Attacking deep learning based biometric systems has drawn more and more attention with the wide deployment of fingerprint/face/speaker recognition systems, given the fact that the neural networks are vulnerable to the adversarial examples, which have been intentionally perturbed to remain almost imperceptible for human.
1 code implementation • 7 Apr 2020 • Jiguo Li, Xinfeng Zhang, Chuanmin Jia, Jizheng Xu, Li Zhang, Yue Wang, Siwei Ma, Wen Gao
In this paper, we attempt to translate the speech signals into the image signals without the transcription stage.
Multimedia Sound Audio and Speech Processing
1 code implementation • ACL 2021 • He Bai, Peng Shi, Jimmy Lin, Luchen Tan, Kun Xiong, Wen Gao, Jie Liu, Ming Li
Experimental results show that the Chinese GPT2 can generate better essay endings with \eop.
no code implementations • 17 Mar 2020 • Ruifeng Shi, Deming Zhai, Xian-Ming Liu, Junjun Jiang, Wen Gao
However, the performance of CNN-based classification approach depends on a large amount of high-quality manually labeled training data, which are inevitably introduced noise on labels in practice, leading to model overfitting and performance degradation.
no code implementations • 10 Jan 2020 • Ling-Yu Duan, Jiaying Liu, Wenhan Yang, Tiejun Huang, Wen Gao
Meanwhile, we systematically review state-of-the-art techniques in video compression and feature compression from the unique perspective of MPEG standardization, which provides the academic and industrial evidence to realize the collaborative compression of video and feature streams in a broad range of AI applications.
no code implementations • 25 Sep 2019 • Tianxiao Gao, Ruiqin Xiong, Zhenhua Liu, Siwei Ma, Feng Wu, Tiejun Huang, Wen Gao
One way to compress these heavy models is knowledge transfer (KT), in which a light student network is trained through absorbing the knowledge from a powerful teacher network.
no code implementations • ICCV 2019 • Jianing Li, Jingdong Wang, Qi Tian, Wen Gao, Shiliang Zhang
The long-term relations are captured by a temporal self-attention model to alleviate the occlusions and noises in video sequences.
1 code implementation • 31 Jul 2019 • Yihang Lou, Ling-Yu Duan, Yong Luo, Ziqian Chen, Tongliang Liu, Shiqi Wang, Wen Gao
The digital retina in smart cities is to select what the City Eye tells the City Brain, and convert the acquired visual data from front-end visual sensors to features in an intelligent sensing manner.
no code implementations • 22 Jul 2019 • Yiqiang Chen, Jindong Wang, Chaohui Yu, Wen Gao, Xin Qin
It is able to achieve accurate and personalized healthcare without compromising privacy and security.
no code implementations • 11 Jun 2019 • Yuanchao Bai, Huizhu Jia, Ming Jiang, Xian-Ming Liu, Xiaodong Xie, Wen Gao
Blind image deblurring is a challenging problem in computer vision, which aims to restore both the blur kernel and the latent sharp image from only a blurry observation.
no code implementations • 3 Jun 2019 • Junlong Gao, Xi Meng, Shiqi Wang, Xia Li, Shanshe Wang, Siwei Ma, Wen Gao
Existing captioning models often adopt the encoder-decoder architecture, where the decoder uses autoregressive decoding to generate captions, such that each token is generated sequentially given the preceding generated tokens.
no code implementations • CVPR 2019 • Junlong Gao, Shiqi Wang, Shanshe Wang, Siwei Ma, Wen Gao
Existing methods for image captioning are usually trained by cross entropy loss, which leads to exposure bias and the inconsistency between the optimizing function and evaluation metrics.
no code implementations • 14 Mar 2019 • Shurun Wang, Shiqi Wang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
In this paper, we propose a scalable image compression scheme, including the base layer for feature representation and enhancement layer for texture representation.
no code implementations • 13 Oct 2018 • Fan Yang, Ke Yan, Shijian Lu, Huizhu Jia, Xiaodong Xie, Wen Gao
Person re-identification (ReID) is a challenging task due to arbitrary human pose variations, background clutters, etc.
no code implementations • 18 Jul 2018 • Meng Li, Shiwen Shen, Wen Gao, William Hsu, Jason Cong
Computed tomography (CT) is increasingly being used for cancer screening, such as early detection of lung cancer.
no code implementations • 25 Jun 2018 • Xiaobin Liu, Shiliang Zhang, Qingming Huang, Wen Gao
Specifically, in addition to extracting global features, RAM also extracts features from a series of local regions.
no code implementations • CVPR 2018 • Bing Li, Chia-Wen Lin, Boxin Shi, Tiejun Huang, Wen Gao, C. -C. Jay Kuo
As compared with traditional video retargeting, stereo video retargeting poses new challenges because stereo video contains the depth information of salient objects and its time dynamics.
no code implementations • 22 Feb 2018 • Yuanchao Bai, Gene Cheung, Xian-Ming Liu, Wen Gao
We leverage the new graph spectral interpretation for RGTV to design an efficient algorithm that solves for the skeleton image and the blur kernel alternately.
no code implementations • 24 Dec 2017 • Yuanchao Bai, Gene Cheung, Xian-Ming Liu, Wen Gao
The problem can be solved in two parts: i) estimate a blur kernel from the blurry image, and ii) given estimated blur kernel, de-convolve blurry input to restore the target image.
no code implementations • 20 Dec 2017 • Jianing Li, Shiliang Zhang, Jingdong Wang, Wen Gao, Qi Tian
This paper mainly establishes a large-scale Long sequence Video database for person re-IDentification (LVreID).
no code implementations • 5 Dec 2017 • Ling-Yu Duan, Yihang Lou, Shiqi Wang, Wen Gao, Yong Rui
To practically facilitate deep neural network models in the large-scale video analysis, there are still unprecedented challenges for the large-scale video data management.
30 code implementations • CVPR 2018 • Longhui Wei, Shiliang Zhang, Wen Gao, Qi Tian
Although the performance of person Re-Identification (ReID) has been significantly boosted, many challenging issues in real scenarios have not been fully investigated, e. g., the complex scenes and lighting variations, viewpoint and pose changes, and the large number of identities in a camera network.
Ranked #11 on Unsupervised Person Re-Identification on DukeMTMC-reID (Rank-10 metric)
no code implementations • 2 Nov 2017 • Zhenqiang Ying, Ge Li, Wen Gao
Inspired by human visual system, we design a multi-exposure fusion framework for low-light image enhancement.
no code implementations • ICCV 2017 • Chi Su, Jianing Li, Shiliang Zhang, Junliang Xing, Wen Gao, Qi Tian
Our deep architecture explicitly leverages the human part cues to alleviate the pose variations and learn robust feature representations from both the global image and different local parts.
Ranked #105 on Person Re-Identification on Market-1501
no code implementations • 13 Sep 2017 • Longhui Wei, Shiliang Zhang, Hantao Yao, Wen Gao, Qi Tian
Targeting to solve these problems, this work proposes a Global-Local-Alignment Descriptor (GLAD) and an efficient indexing and retrieval framework, respectively.
Ranked #92 on Person Re-Identification on Market-1501
no code implementations • ICCV 2017 • Zefan Li, Bingbing Ni, Wenjun Zhang, Xiaokang Yang, Wen Gao
Input binarization has shown to be an effective way for network acceleration.
no code implementations • 13 Jun 2017 • Jinzhuo Wang, Wenmin Wang, Ronggang Wang, Wen Gao
We show such setting can preserve more contexts of local features and its evolutions which are beneficial for move prediction.
no code implementations • 26 Apr 2017 • Ling-Yu Duan, Vijay Chandrasekhar, Shiqi Wang, Yihang Lou, Jie Lin, Yan Bai, Tiejun Huang, Alex ChiChung Kot, Wen Gao
This paper provides an overview of the on-going compact descriptors for video analysis standard (CDVA) from the ISO/IEC moving pictures experts group (MPEG).
no code implementations • 4 Apr 2017 • Rui Chen, Huizhu Jia, Xiaodong Xie, Wen Gao
The multiscale dictionary is considered as the product of oscillating dictionary and tolerance dictionary.
no code implementations • 12 Mar 2017 • Yang Zhao, Ronggang Wang, Wei Jia, Jianchao Yang, Wenmin Wang, Wen Gao
The proposed method consists of a learning stage and a reconstructing stage.
no code implementations • 23 Dec 2016 • Rui Chen, Huizhu Jia, Xiaodong Xie, Wen Gao
In this letter, we propose a novel image denoising method based on correlation preserving sparse coding.
no code implementations • 23 Dec 2016 • Rui Chen, Huizhu Jia, Xiaodong Xie, Wen Gao
Aerial images are often degraded by space-varying motion blur and simultaneous uneven illumination.
no code implementations • 12 Dec 2016 • Diqi Chen, Yizhou Wang, Tianfu Wu, Wen Gao
The model learning is implemented by a reinforcement strategy, in which the rewards of both tasks guide the learning of the optimal sampling policy to acquire the "task-informative" image regions so that the predictions can be made accurately and efficiently (in terms of the sampling steps).
no code implementations • NeurIPS 2016 • Jinzhuo Wang, Wenmin Wang, Xiongtao Chen, Ronggang Wang, Wen Gao
This paper instead explores contexts as early as possible and leverages their evolutions for action recognition.
no code implementations • 17 Aug 2016 • Xiang Zhang, Jiarui Sun, Siwei Ma, Zhouchen Lin, Jian Zhang, Shiqi Wang, Wen Gao
Therefore, introducing an accurate rate-constraint in sparse coding and dictionary learning becomes meaningful, which has not been fully exploited in the context of sparse representation.
no code implementations • 11 May 2016 • Chi Su, Shiliang Zhang, Junliang Xing, Wen Gao, Qi Tian
And we propose a semi-supervised attribute learning framework which progressively boosts the accuracy of attributes only using a limited number of labeled data.
no code implementations • NeurIPS 2016 • Bo Xin, Yizhou Wang, Wen Gao, David Wipf
The iterations of many sparse estimation algorithms are comprised of a fixed linear filter cascaded with a thresholding nonlinearity, which collectively resemble a typical neural network layer.
no code implementations • ICCV 2015 • Chi Su, Fan Yang, Shiliang Zhang, Qi Tian, Larry S. Davis, Wen Gao
Since attributes are generally correlated, we introduce a low rank attribute embedding into the MTL formulation to embed original binary attributes to a continuous attribute space, where incorrect and incomplete attributes are rectified and recovered to better describe people.
no code implementations • 29 Jul 2015 • Annan Li, Shiguang Shan, Xilin Chen, Bingpeng Ma, Shuicheng Yan, Wen Gao
We argue that one of the diffculties in this problem is the severe misalignment in face images or feature vectors with different poses.
no code implementations • CVPR 2015 • Hangfan Liu, Ruiqin Xiong, Jian Zhang, Wen Gao
To estimate the expectation and variance parameters for the transform bands of a particular patch, we exploit the non-local correlation of image and collect a set of similar patches as data samples to form the distribution.
no code implementations • CVPR 2015 • Bo Xin, Yuan Tian, Yizhou Wang, Wen Gao
Background Subtraction (BS) is one of the key steps in video analysis.
no code implementations • 25 Mar 2015 • Bo Xin, Lingjing Hu, Yizhou Wang, Wen Gao
Neuroimage analysis usually involves learning thousands or even millions of variables using only a limited number of samples.
no code implementations • CVPR 2014 • Chunyu Wang, Yizhou Wang, Zhouchen Lin, Alan L. Yuille, Wen Gao
We address the challenges in three ways: (i) We represent a 3D pose as a linear combination of a sparse set of bases learned from 3D human skeletons.
Ranked #29 on 3D Human Pose Estimation on HumanEva-I
1 code implementation • 14 May 2014 • Jian Zhang, Debin Zhao, Wen Gao
In this paper, instead of using patch as the basic unit of sparse representation, we exploit the concept of group as the basic unit of sparse representation, which is composed of nonlocal patches with similar structures, and establish a novel sparse representation modeling of natural images, called group-based sparse representation (GSR).
no code implementations • 11 May 2014 • Jian Zhang, Debin Zhao, Ruiqin Xiong, Siwei Ma, Wen Gao
This paper presents a novel strategy for high-fidelity image restoration by characterizing both local smoothness and nonlocal self-similarity of natural images in a unified statistical manner.
no code implementations • 30 Apr 2014 • Jian Zhang, Chen Zhao, Debin Zhao, Wen Gao
From many fewer acquired measurements than suggested by the Nyquist sampling theory, compressive sensing (CS) theory demonstrates that, a signal can be reconstructed with high probability when it exhibits sparsity in some domain.
no code implementations • 29 Apr 2014 • Jian Zhang, Debin Zhao, Feng Jiang, Wen Gao
Compressive Sensing (CS) theory shows that a signal can be decoded from many fewer measurements than suggested by the Nyquist sampling theory, when the signal is sparse in some domain.