Search Results for author: Wenxiu Sun

Found 40 papers, 27 papers with code

Learning Factorized Weight Matrix for Joint Image Filtering

no code implementations ICML 2020 Xiangyu Xu, Yongrui Ma, Wenxiu Sun

In this work, we propose to learn the weight matrix for joint image filtering.

Deep Surface Normal Estimation on the 2-Sphere with Confidence Guided Semantic Attention

no code implementations ECCV 2020 Quewei Li, Jie Guo, Yang Fei, Qinyu Tang, Wenxiu Sun, Jin Zeng, Yanwen Guo

We propose a deep convolutional neural network (CNN) to estimate surface normal from a single color image accompanied with a low-quality depth channel.

Surface Normal Estimation

Exploring the Effectiveness of Video Perceptual Representation in Blind Video Quality Assessment

no code implementations8 Jul 2022 Liang Liao, Kangmin Xu, HaoNing Wu, Chaofeng Chen, Wenxiu Sun, Qiong Yan, Weisi Lin

Experiments show that the perceptual representation in the HVS is an effective way of predicting subjective temporal quality, and thus TPQI can, for the first time, achieve comparable performance to the spatial quality metric and be even more effective in assessing videos with large temporal variations.

Video Quality Assessment Visual Question Answering

FAST-VQA: Efficient End-to-end Video Quality Assessment with Fragment Sampling

1 code implementation6 Jul 2022 HaoNing Wu, Chaofeng Chen, Jingwen Hou, Liang Liao, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin

Consisting of fragments and FANet, the proposed FrAgment Sample Transformer for VQA (FAST-VQA) enables efficient end-to-end deep VQA and learns effective video-quality-related representations.

 Ranked #1 on Video Quality Assessment on YouTube-UGC (using extra training data)

Video Quality Assessment

DisCoVQA: Temporal Distortion-Content Transformers for Video Quality Assessment

no code implementations20 Jun 2022 HaoNing Wu, Chaofeng Chen, Liang Liao, Jingwen Hou, Wenxiu Sun, Qiong Yan, Weisi Lin

Based on prominent time-series modeling ability of transformers, we propose a novel and effective transformer-based VQA method to tackle these two issues.

Time Series Video Quality Assessment +1

FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting

1 code implementation ICCV 2021 Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li

On the contrary, the soft composition operates by stitching different patches into a whole feature map where pixels in overlapping regions are summed up.

Video Inpainting

Dual-Camera Super-Resolution with Aligned Attention Modules

2 code implementations ICCV 2021 Tengfei Wang, Jiaxin Xie, Wenxiu Sun, Qiong Yan, Qifeng Chen

We present a novel approach to reference-based super-resolution (RefSR) with the focus on dual-camera super-resolution (DCSR), which utilizes reference images for high-quality and high-fidelity results.

Domain Adaptation Reference-based Super-Resolution

A Categorized Reflection Removal Dataset with Diverse Real-world Scenes

no code implementations7 Aug 2021 Chenyang Lei, Xuhua Huang, Chenyang Qi, Yankun Zhao, Wenxiu Sun, Qiong Yan, Qifeng Chen

Due to the lack of a large-scale reflection removal dataset with diverse real-world scenes, many existing reflection removal methods are trained on synthetic data plus a small amount of real-world data, which makes it difficult to evaluate the strengths or weaknesses of different reflection removal methods thoroughly.

Reflection Removal

Decoupled Spatial-Temporal Transformer for Video Inpainting

1 code implementation14 Apr 2021 Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li

Seamless combination of these two novel designs forms a better spatial-temporal attention scheme and our proposed model achieves better performance than state-of-the-art video inpainting approaches with significant boosted efficiency.

Video Inpainting

Deep Animation Video Interpolation in the Wild

1 code implementation CVPR 2021 Li SiYao, Shiyu Zhao, Weijiang Yu, Wenxiu Sun, Dimitris N. Metaxas, Chen Change Loy, Ziwei Liu

In the animation industry, cartoon videos are usually produced at low frame rate since hand drawing of such frames is costly and time-consuming.

Optical Flow Estimation Video Frame Interpolation

Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch

3 code implementations ICLR 2021 Aojun Zhou, Yukun Ma, Junnan Zhu, Jianbo Liu, Zhijie Zhang, Kun Yuan, Wenxiu Sun, Hongsheng Li

In this paper, we are the first to study training from scratch an N:M fine-grained structured sparse network, which can maintain the advantages of both unstructured fine-grained sparsity and structured coarse-grained sparsity simultaneously on specifically designed GPUs.

Exploiting Raw Images for Real-Scene Super-Resolution

1 code implementation2 Feb 2021 Xiangyu Xu, Yongrui Ma, Wenxiu Sun, Ming-Hsuan Yang

In this paper, we study the problem of real-scene single image super-resolution to bridge the gap between synthetic data and real captured images.

Image Restoration Image Super-Resolution +1

Semi-synthesis: A fast way to produce effective datasets for stereo matching

no code implementations26 Jan 2021 Ju He, Enyu Zhou, Liusheng Sun, Fei Lei, Chenyang Liu, Wenxiu Sun

Though synthetic dataset is proposed to fill the gaps of large data demand, the fine-tuning on real dataset is still needed due to the domain variances between synthetic data and real data.

Stereo Matching

Learning Spatial and Spatio-Temporal Pixel Aggregations for Image and Video Denoising

3 code implementations26 Jan 2021 Xiangyu Xu, Muchen Li, Wenxiu Sun, Ming-Hsuan Yang

We present a spatial pixel aggregation network and learn the pixel sampling and averaging strategies for image denoising.

Image Denoising Video Denoising

Enhanced Quadratic Video Interpolation

2 code implementations10 Sep 2020 Yihao Liu, Liangbin Xie, Li Si-Yao, Wenxiu Sun, Yu Qiao, Chao Dong

In this work, we further improve the performance of QVI from three facets and propose an enhanced quadratic video interpolation (EQVI) model.

Super-Resolution Video Frame Interpolation

Towards Geometry Guided Neural Relighting with Flash Photography

no code implementations12 Aug 2020 Di Qiu, Jin Zeng, Zhanghan Ke, Wenxiu Sun, Chengxi Yang

By incorporating the depth map, our approach is able to extrapolate realistic high-frequency effects under novel lighting via geometry guided image decomposition from the flashlight image, and predict the cast shadow map from the shadow-encoding transformed depth map.

Image Relighting Intrinsic Image Decomposition

Pix2Vox++: Multi-scale Context-aware 3D Object Reconstruction from Single and Multiple Images

3 code implementations22 Jun 2020 Haozhe Xie, Hongxun Yao, Shengping Zhang, Shangchen Zhou, Wenxiu Sun

A multi-scale context-aware fusion module is then introduced to adaptively select high-quality reconstructions for different parts from all coarse 3D volumes to obtain a fused 3D volume.

3D Object Reconstruction

GRNet: Gridding Residual Network for Dense Point Cloud Completion

1 code implementation ECCV 2020 Haozhe Xie, Hongxun Yao, Shangchen Zhou, Jiageng Mao, Shengping Zhang, Wenxiu Sun

In particular, we devise two novel differentiable layers, named Gridding and Gridding Reverse, to convert between point clouds and 3D grids without losing structural information.

Point Cloud Completion

Quadratic video interpolation

1 code implementation NeurIPS 2019 Xiangyu Xu, Li Si-Yao, Wenxiu Sun, Qian Yin, Ming-Hsuan Yang

Video interpolation is an important problem in computer vision, which helps overcome the temporal limitation of camera sensors.

Toward 3D Object Reconstruction from Stereo Images

1 code implementation18 Oct 2019 Haozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, Xiaoshuai Sun, Wenxiu Sun

Inferring the 3D shape of an object from an RGB image has shown impressive results, however, existing methods rely primarily on recognizing the most similar 3D model from the training set to solve the problem.

3D Object Reconstruction

Deep End-to-End Alignment and Refinement for Time-of-Flight RGB-D Module

1 code implementation ICCV 2019 Di Qiu, Jiahao Pang, Wenxiu Sun, Chengxi Yang

Recently, it is increasingly popular to equip mobile RGB cameras with Time-of-Flight (ToF) sensors for active depth sensing.

Optical Flow Estimation

Towards Real Scene Super-Resolution with Raw Images

1 code implementation CVPR 2019 Xiangyu Xu, Yongrui Ma, Wenxiu Sun

Most existing super-resolution methods do not perform well in real scenarios due to lack of realistic training data and information loss of the model input.

Image Super-Resolution Single Image Super Resolution

Learning Deformable Kernels for Image and Video Denoising

2 code implementations15 Apr 2019 Xiangyu Xu, Muchen Li, Wenxiu Sun

Most of the classical denoising methods restore clear results by selecting and averaging pixels in the noisy input.

Image Denoising Video Denoising

DSR: Direct Self-rectification for Uncalibrated Dual-lens Cameras

1 code implementation26 Sep 2018 Ruichao Xiao, Wenxiu Sun, Jiahao Pang, Qiong Yan, Jimmy Ren

Our method is evaluated on both real-istic and synthetic stereo image pairs, and produces supe-rior results compared to the calibrated rectification or otherself-rectification approaches

Stereo Matching Stereo Matching Hand

Confidence Inference for Focused Learning in Stereo Matching

no code implementations25 Sep 2018 Ruichao Xiao, Wenxiu Sun, Chengxi Yang

Intuitively, the vari-ance in the Laplacian distribution is large for low confidentpixels while small for high-confidence pixels.

Stereo Matching Stereo Matching Hand

Monocular Depth Estimation with Affinity, Vertical Pooling, and Label Enhancement

no code implementations ECCV 2018 Yukang Gan, Xiangyu Xu, Wenxiu Sun, Liang Lin

While significant progress has been made in monocular depth estimation with Convolutional Neural Networks (CNNs) extracting absolute features, such as edges and textures, the depth constraint of neighboring pixels, namely relative features, has been mostly ignored by recent methods.

Monocular Depth Estimation Stereo Matching +1

Deep Graph Laplacian Regularization for Robust Denoising of Real Images

1 code implementation31 Jul 2018 Jin Zeng, Jiahao Pang, Wenxiu Sun, Gene Cheung

In this work, we combine the robustness merit of model-based approaches and the learning power of data-driven approaches for real image denoising.

Domain Generalization Image Denoising +1

Zoom and Learn: Generalizing Deep Stereo Matching to Novel Domains

1 code implementation CVPR 2018 Jiahao Pang, Wenxiu Sun, Chengxi Yang, Jimmy Ren, Ruichao Xiao, Jin Zeng, Liang Lin

By feeding real stereo pairs of different domains to stereo models pre-trained with synthetic data, we see that: i) a pre-trained model does not generalize well to the new domain, producing artifacts at boundaries and ill-posed regions; however, ii) feeding an up-sampled stereo pair leads to a disparity map with extra details.

Stereo Matching Stereo Matching Hand

Single View Stereo Matching

1 code implementation CVPR 2018 Yue Luo, Jimmy Ren, Mude Lin, Jiahao Pang, Wenxiu Sun, Hongsheng Li, Liang Lin

The resulting model outperforms all the previous monocular depth estimation methods as well as the stereo block matching method in the challenging KITTI dataset by only using a small number of real training data.

Ranked #18 on Monocular Depth Estimation on KITTI Eigen split (using extra training data)

Monocular Depth Estimation Stereo Matching +1

LSTM Pose Machines

1 code implementation CVPR 2018 Yue Luo, Jimmy Ren, Zhouxia Wang, Wenxiu Sun, Jinshan Pan, Jianbo Liu, Jiahao Pang, Liang Lin

Such suboptimal results are mainly attributed to the inability of imposing sequential geometric consistency, handling severe image quality degradation (e. g. motion blur and occlusion) as well as the inability of capturing the temporal correlation among video frames.

Pose Estimation

Image Dehazing using Bilinear Composition Loss Function

no code implementations1 Oct 2017 Hui Yang, Jinshan Pan, Qiong Yan, Wenxiu Sun, Jimmy Ren, Yu-Wing Tai

In this paper, we introduce a bilinear composition loss function to address the problem of image dehazing.

Image Dehazing

Cascade Residual Learning: A Two-stage Convolutional Neural Network for Stereo Matching

1 code implementation30 Aug 2017 Jiahao Pang, Wenxiu Sun, Jimmy SJ. Ren, Chengxi Yang, Qiong Yan

As opposed to directly learning the disparity at the second stage, we show that residual learning provides more effective refinement.

Stereo Matching Stereo Matching Hand

Robust Tracking Using Region Proposal Networks

no code implementations30 May 2017 Jimmy Ren, ZHIYANG YU, Jianbo Liu, Rui Zhang, Wenxiu Sun, Jiahao Pang, Xiaohao Chen, Qiong Yan

Recent advances in visual tracking showed that deep Convolutional Neural Networks (CNN) trained for image classification can be strong feature extractors for discriminative trackers.

Classification Feature Engineering +4

Look, Listen and Learn - A Multimodal LSTM for Speaker Identification

no code implementations13 Feb 2016 Jimmy Ren, Yongtao Hu, Yu-Wing Tai, Chuan Wang, Li Xu, Wenxiu Sun, Qiong Yan

This task not only requires collective perception over both visual and auditory signals, the robustness to handle severe quality degradations and unconstrained content variations are also indispensable.

Speaker Identification

Shepard Convolutional Neural Networks

1 code implementation NeurIPS 2015 Jimmy SJ. Ren, Li Xu, Qiong Yan, Wenxiu Sun

In this paper, we draw on Shepard interpolation and design Shepard Convolutional Neural Networks (ShCNN) which efficiently realizes end-to-end trainable TVI operators in the network.

Image Inpainting Super-Resolution +1

Cannot find the paper you are looking for? You can Submit a new open access paper.