no code implementations • 24 Nov 2024 • Yanchen Zhao, Wenhong Duan, Chuanmin Jia, Shanshe Wang, Siwei Ma
LLIP enhances the filtering process by leveraging a lightweight neural network model, where parameters can be exported for efficient inference.
1 code implementation • ICCV 2023 • Wenkang Shan, Zhenhua Liu, Xinfeng Zhang, Zhao Wang, Kai Han, Shanshe Wang, Siwei Ma, Wen Gao
On the other hand, JPMA is proposed to assemble multiple hypotheses generated by D3DP into a single 3D pose for practical use.
1 code implementation • 13 Nov 2022 • Qi Zhang, Shanshe Wang, Xinfeng Zhang, Chuanmin Jia, Zhao Wang, Siwei Ma, Wen Gao
Each score is derived from machine perceptual differences between original and compressed images.
1 code implementation • 9 Jun 2022 • Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
To solve the information loss problem, the proposed model aims to preserve the spatiotemporal information for videos during the feature extraction and the state transitions, respectively.
no code implementations • 7 Jun 2022 • Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao
As a highly ill-posed issue, single image super-resolution (SISR) has been widely investigated in recent years.
1 code implementation • 27 May 2022 • Yuqing Liu, Qi Jia, Shanshe Wang, Siwei Ma, Wen Gao
Image super-resolution (SR) has been widely investigated in recent years.
1 code implementation • 26 Apr 2022 • Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao
Existing BDE methods have no unified solution for various BDE situations, and directly learn a mapping for each pixel from LBD image to the desired value in HBD image, which may change the given high-order bits and lead to a huge deviation from the ground truth.
no code implementations • 20 Apr 2022 • Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
In this paper, we propose a SpatioTemporal-Aware Unit (STAU) for video prediction and beyond by exploring the significant spatiotemporal correlations in videos.
1 code implementation • CVPR 2022 • Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
In this paper, we propose a Spatiotemporal Residual Predictive Model (STRPM) for high-resolution video prediction.
1 code implementation • 15 Mar 2022 • Wenkang Shan, Zhenhua Liu, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
In Stage II, the pre-trained encoder is loaded to STMO model and fine-tuned.
Ranked #10 on Monocular 3D Human Pose Estimation on Human3.6M
no code implementations • 5 Jan 2022 • Yuqing Liu, Qi Jia, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao
It is challenging to restore low-resolution (LR) images to super-resolution (SR) images with correct and clear details.
1 code implementation • NeurIPS 2021 • Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Yan Ye, Xiang Xinguang, Wen Gao
The attention module aims to learn an attention map based on the correlations between the current spatial state and the historical spatial states.
Ranked #19 on Video Prediction on Moving MNIST
2 code implementations • 13 Sep 2021 • Kai Li, Jie Yang, Siwei Ma, Bo wang, Shanshe Wang, Yingjie Tian, Zhiquan Qi
For the second issue, we reconsider how to improve detection efficiency with excellent performance, and then propose our lightweight encoder-decoder architecture termed CarNet.
1 code implementation • 29 Jul 2021 • Wenkang Shan, Haopeng Lu, Shanshe Wang, Xinfeng Zhang, Wen Gao
To alleviate these two problems, we propose a relative information encoding method that yields positional and temporal enhanced representations.
Ranked #13 on Monocular 3D Human Pose Estimation on Human3.6M
no code implementations • 24 Jun 2021 • Chuanmin Jia, Ziqing Ge, Shanshe Wang, Siwei Ma, Wen Gao
End-to-end optimized neural image compression (NIC) has obtained superior lossy compression performance recently.
no code implementations • 21 Apr 2021 • Zhimeng Huang, Chuanmin Jia, Shanshe Wang, Siwei Ma
We first propose the region of interest for machine (ROIM) to evaluate the degree of importance for each coding tree unit (CTU) in visual analysis.
1 code implementation • 12 Oct 2020 • Lingbo Yang, Pan Wang, Zhanning Gao, Shanshe Wang, Peiran Ren, Siwei Ma, Wen Gao
Face restoration is an inherently ill-posed problem, where additional prior constraints are typically considered crucial for mitigating such pathology.
no code implementations • 19 Jul 2020 • Yuqing Liu, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
Based on the observation, in this paper, we build a sequential hierarchical learning super-resolution network (SHSR) for effective image SR.
Ranked #12 on Image Super-Resolution on Manga109 - 3x upscaling
1 code implementation • 26 May 2020 • Lingbo Yang, Pan Wang, Xinfeng Zhang, Shanshe Wang, Zhanning Gao, Peiran Ren, Xuansong Xie, Siwei Ma, Wen Gao
The ability to produce convincing textural details is essential for the fidelity of synthesized person images.
Ranked #4 on Pose Transfer on Deep-Fashion
no code implementations • 26 May 2020 • Lingbo Yang, Pan Wang, Chang Liu, Zhanning Gao, Peiran Ren, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Xian-Sheng Hua, Wen Gao
Human pose transfer (HPT) is an emerging research topic with huge potential in fashion design, media production, online advertising and virtual reality.
1 code implementation • 20 May 2020 • Yuqing Liu, Shiqi Wang, Jian Zhang, Shanshe Wang, Siwei Ma, Wen Gao
A novel iterative super-resolution network (ISRN) is proposed on top of the iterative optimization.
5 code implementations • 11 May 2020 • Lingbo Yang, Chang Liu, Pan Wang, Shanshe Wang, Peiran Ren, Siwei Ma, Wen Gao
Existing face restoration researches typically relies on either the degradation prior or explicit guidance labels for training, which often results in limited generalization ability over real-world images with heterogeneous degradations and rich background contents.
no code implementations • 21 Apr 2020 • Shurun Wang, Shiqi Wang, Wenhan Yang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
In particular, we study the feature and texture compression in a scalable coding framework, where the base layer serves as the deep learning feature and enhancement layer targets to perfectly reconstruct the texture.
no code implementations • 3 Jun 2019 • Junlong Gao, Xi Meng, Shiqi Wang, Xia Li, Shanshe Wang, Siwei Ma, Wen Gao
Existing captioning models often adopt the encoder-decoder architecture, where the decoder uses autoregressive decoding to generate captions, such that each token is generated sequentially given the preceding generated tokens.
no code implementations • CVPR 2019 • Junlong Gao, Shiqi Wang, Shanshe Wang, Siwei Ma, Wen Gao
Existing methods for image captioning are usually trained by cross entropy loss, which leads to exposure bias and the inconsistency between the optimizing function and evaluation metrics.
no code implementations • 7 Apr 2019 • Siwei Ma, Xinfeng Zhang, Chuanmin Jia, Zhenghui Zhao, Shiqi Wang, Shanshe Wang
Deep convolution neural network (CNN) which makes the neural network resurge in recent years and has achieved great success in both artificial intelligent and signal processing fields, also provides a novel and promising solution for image and video compression.
no code implementations • 14 Mar 2019 • Shurun Wang, Shiqi Wang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao
In this paper, we propose a scalable image compression scheme, including the base layer for feature representation and enhancement layer for texture representation.
no code implementations • 25 Sep 2017 • Chuanmin Jia, Shiqi Wang, Xinfeng Zhang, Shanshe Wang, Siwei Ma
Deep learning has demonstrated tremendous break through in the area of image/video processing.
Multimedia