Search Results for author: Xiangyu Xu

Found 37 papers, 20 papers with code

BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment

3 code implementations • CVPR 2022 • Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy

We show that by empowering the recurrent framework with the enhanced propagation and alignment, one can exploit spatiotemporal information across misaligned video frames more effectively.

Ranked #1 on Video Enhancement on MFQE v2

Analog Video Restoration Video Enhancement +1

6,564

Paper
Code

GLEAN: Generative Latent Bank for Image Super-Resolution and Beyond

1 code implementation • 29 Jul 2022 • Kelvin C. K. Chan, Xiangyu Xu, Xintao Wang, Jinwei Gu, Chen Change Loy

While most existing perceptual-oriented approaches attempt to generate realistic outputs through learning with adversarial loss, our method, Generative LatEnt bANk (GLEAN), goes beyond existing practices by directly leveraging rich and diverse priors encapsulated in a pre-trained GAN.

Colorization Image Colorization +2

6,564

Paper
Code

Investigating Tradeoffs in Real-World Video Super-Resolution

1 code implementation • CVPR 2022 • Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy

The diversity and complexity of degradations in real-world video super-resolution (VSR) pose non-trivial challenges in inference and training.

Ranked #9 on Video Super-Resolution on MSU Video Upscalers: Quality Enhancement

Benchmarking Video Super-Resolution

836

Paper
Code

On the Generalization of BasicVSR++ to Video Deblurring and Denoising

1 code implementation • 11 Apr 2022 • Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy

The exploitation of long-term information has been a long-standing problem in video restoration.

Deblurring Denoising +2

530

Paper
Code

3D Human Shape and Pose from a Single Low-Resolution Image with Self-Supervised Learning

2 code implementations • ECCV 2020 • Xiangyu Xu, Hao Chen, Francesc Moreno-Noguer, Laszlo A. Jeni, Fernando de la Torre

3D human shape and pose estimation from monocular images has been an active area of research in computer vision, having a substantial impact on the development of new applications, from activity recognition to creating virtual avatars.

Ranked #71 on 3D Human Pose Estimation on MPI-INF-3DHP

3D Human Pose Estimation 3D Shape Reconstruction +4

216

Paper
Code

Learning Spatial and Spatio-Temporal Pixel Aggregations for Image and Video Denoising

3 code implementations • 26 Jan 2021 • Xiangyu Xu, Muchen Li, Wenxiu Sun, Ming-Hsuan Yang

We present a spatial pixel aggregation network and learn the pixel sampling and averaging strategies for image denoising.

Image Denoising Video Denoising

216

Paper
Code

3D Human Pose, Shape and Texture from Low-Resolution Images and Videos

2 code implementations • 11 Mar 2021 • Xiangyu Xu, Hao Chen, Francesc Moreno-Noguer, Laszlo A. Jeni, Fernando de la Torre

Two common approaches to deal with low-resolution images are applying super-resolution techniques to the input, which may result in unpleasant artifacts, or simply training one model for each resolution, which is impractical in many realistic applications.

3D human pose and shape estimation Contrastive Learning +1

216

Paper
Code

3D Human Texture Estimation from a Single Image with Transformers

1 code implementation • ICCV 2021 • Xiangyu Xu, Chen Change Loy

We propose a Transformer-based framework for 3D human texture estimation from a single image.

Garment Reconstruction

216

Paper
Code

Towards Garment Sewing Pattern Reconstruction from a Single Image

1 code implementation • 7 Nov 2023 • Lijuan Liu, Xiangyu Xu, Zhijie Lin, Jiabin Liang, Shuicheng Yan

In this work, we explore the challenging problem of recovering garment sewing patterns from daily photos for augmenting these applications.

Garment Reconstruction Texture Synthesis +1

110

Paper
Code

Video Frame Interpolation Transformer

1 code implementation • CVPR 2022 • Zhihao Shi, Xiangyu Xu, Xiaohong Liu, Jun Chen, Ming-Hsuan Yang

Existing methods for video interpolation heavily rely on deep convolution neural networks, and thus suffer from their intrinsic limitations, such as content-agnostic kernel weights and restricted receptive field.

Video Frame Interpolation

Paper
Code

Learning Deformable Kernels for Image and Video Denoising

2 code implementations • 15 Apr 2019 • Xiangyu Xu, Muchen Li, Wenxiu Sun

Most of the classical denoising methods restore clear results by selecting and averaging pixels in the noisy input.

Image Denoising Video Denoising

Paper
Code

NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

1 code implementation • 21 Apr 2021 • Ren Yang, Radu Timofte, Jing Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng Li, Thomas Tanay, Fenglong Song, Wentao Chao, Qiang Guo, Yan Liu, Jiang Li, Xiaochao Qu, Dewang Hou, Jiayu Yang, Lyn Jiang, Di You, Zhenyu Zhang, Chong Mou, Iaroslav Koshelev, Pavel Ostyakov, Andrey Somov, Jia Hao, Xueyi Zou, Shijie Zhao, Xiaopeng Sun, Yiting Liao, Yuanzhi Zhang, Qing Wang, Gen Zhan, Mengxi Guo, Junlin Li, Ming Lu, Zhan Ma, Pablo Navarrete Michelini, Hai Wang, Yiyun Chen, Jingyu Guo, Liliang Zhang, Wenming Yang, Sijung Kim, Syehoon Oh, Yucong Wang, Minjie Cai, Wei Hao, Kangdi Shi, Liangyan Li, Jun Chen, Wei Gao, Wang Liu, XiaoYu Zhang, Linjie Zhou, Sixin Lin, Ru Wang

This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results.

Paper
Code

Towards Real Scene Super-Resolution with Raw Images

1 code implementation • CVPR 2019 • Xiangyu Xu, Yongrui Ma, Wenxiu Sun

Most existing super-resolution methods do not perform well in real scenarios due to lack of realistic training data and information loss of the model input.

Image Super-Resolution

Paper
Code

Exploiting Raw Images for Real-Scene Super-Resolution

1 code implementation • 2 Feb 2021 • Xiangyu Xu, Yongrui Ma, Wenxiu Sun, Ming-Hsuan Yang

In this paper, we study the problem of real-scene single image super-resolution to bridge the gap between synthetic data and real captured images.

Image Restoration Image Super-Resolution

Paper
Code

Quadratic video interpolation

1 code implementation • NeurIPS 2019 • Xiangyu Xu, Li Si-Yao, Wenxiu Sun, Qian Yin, Ming-Hsuan Yang

Video interpolation is an important problem in computer vision, which helps overcome the temporal limitation of camera sensors.

Paper
Code

Progressive Text-to-3D Generation for Automatic 3D Prototyping

1 code implementation • 26 Sep 2023 • Han Yi, Zhedong Zheng, Xiangyu Xu, Tat-Seng Chua

We aspire for our work to pave the way for automatic 3D prototyping via natural language descriptions.

3D Generation Text to 3D

Paper
Code

Cylin-Painting: Seamless {360\textdegree} Panoramic Image Outpainting and Beyond

1 code implementation • 18 Apr 2022 • Kang Liao, Xiangyu Xu, Chunyu Lin, Wenqi Ren, Yunchao Wei, Yao Zhao

Motivated by this analysis, we present a Cylin-Painting framework that involves meaningful collaborations between inpainting and outpainting and efficiently fuses the different arrangements, with a view to leveraging their complementary benefits on a seamless cylinder.

Depth Estimation Image Outpainting +3

Paper
Code

NARUTO: Neural Active Reconstruction from Uncertain Target Observations

1 code implementation • 29 Feb 2024 • Ziyue Feng, Huangying Zhan, Zheng Chen, Qingan Yan, Xiangyu Xu, Changjiang Cai, Bing Li, Qilun Zhu, Yi Xu

We present NARUTO, a neural active reconstruction system that combines a hybrid neural representation with uncertainty learning, enabling high-fidelity surface reconstruction.

Surface Reconstruction

Paper
Code

NU-MCC: Multiview Compressive Coding with Neighborhood Decoder and Repulsive UDF

1 code implementation • NeurIPS 2023 • Stefan Lionar, Xiangyu Xu, Min Lin, Gim Hee Lee

Second, our Repulsive UDF is a novel alternative to the occupancy field used in MCC, significantly improving the quality of 3D object reconstruction.

Ranked #1 on Single-View 3D Reconstruction on Common Objects in 3D

3D Object Reconstruction 3D Reconstruction +2

Paper
Code

DDM-NET: End-to-end learning of keypoint feature Detection, Description and Matching for 3D localization

1 code implementation • 8 Dec 2022 • Xiangyu Xu, Li Guan, Enrique Dunn, Haoxiang Li, Gang Hua

In this paper, we propose an end-to-end framework that jointly learns keypoint detection, descriptor representation and cross-frame matching for the task of image-based 3D localization.

Keypoint Detection

Paper
Code

Rendering Portraitures from Monocular Camera and Beyond

no code implementations • ECCV 2018 • Xiangyu Xu, Deqing Sun, Sifei Liu, Wenqi Ren, Yu-Jin Zhang, Ming-Hsuan Yang, Jian Sun

Specifically, we first exploit Convolutional Neural Networks to estimate the relative depth and portrait segmentation maps from a single input image.

Image Matting Portrait Segmentation +1

Paper
Add Code

Monocular Depth Estimation with Affinity, Vertical Pooling, and Label Enhancement

no code implementations • ECCV 2018 • Yukang Gan, Xiangyu Xu, Wenxiu Sun, Liang Lin

While significant progress has been made in monocular depth estimation with Convolutional Neural Networks (CNNs) extracting absolute features, such as edges and textures, the depth constraint of neighboring pixels, namely relative features, has been mostly ignored by recent methods.

Monocular Depth Estimation Stereo Matching +1

Paper
Add Code

Learning to Super-Resolve Blurry Face and Text Images

no code implementations • ICCV 2017 • Xiangyu Xu, Deqing Sun, Jinshan Pan, Yu-Jin Zhang, Hanspeter Pfister, Ming-Hsuan Yang

We present an algorithm to directly restore a clear high-resolution image from a blurry low-resolution input.

Deblurring Generative Adversarial Network +2

Paper
Add Code

Discrete Laplace Operator Estimation for Dynamic 3D Reconstruction

no code implementations • ICCV 2019 • Xiangyu Xu, Enrique Dunn

We present a general paradigm for dynamic 3D reconstruction from multiple independent and uncontrolled image sources having arbitrary temporal sampling density and distribution.

3D Reconstruction Event Segmentation

Paper
Add Code

Learning Factorized Weight Matrix for Joint Image Filtering

no code implementations • ICML 2020 • Xiangyu Xu, Yongrui Ma, Wenxiu Sun

In this work, we propose to learn the weight matrix for joint image filtering.

Feature Correlation

Paper
Add Code

GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution

no code implementations • CVPR 2021 • Kelvin C. K. Chan, Xintao Wang, Xiangyu Xu, Jinwei Gu, Chen Change Loy

We show that pre-trained Generative Adversarial Networks (GANs), e. g., StyleGAN, can be used as a latent bank to improve the restoration quality of large-factor image super-resolution (SR).

Image Super-Resolution

Paper
Add Code

GTT-Net: Learned Generalized Trajectory Triangulation

no code implementations • ICCV 2021 • Xiangyu Xu, Enrique Dunn

We present GTT-Net, a supervised learning framework for the reconstruction of sparse dynamic 3D geometry.

Event Segmentation

Paper
Add Code

The Nuts and Bolts of Adopting Transformer in GANs

no code implementations • 25 Oct 2021 • Rui Xu, Xiangyu Xu, Kai Chen, Bolei Zhou, Chen Change Loy

Transformer becomes prevalent in computer vision, especially for high-level vision tasks.

Generative Adversarial Network Image Generation

Paper
Add Code

Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering

no code implementations • 8 Dec 2021 • Mingfei Chen, Jianfeng Zhang, Xiangyu Xu, Lijuan Liu, Yujun Cai, Jiashi Feng, Shuicheng Yan

Meanwhile, for achieving higher rendering efficiency, we introduce a progressive rendering pipeline through geometry guidance, which leverages the geometric feature volume and the predicted density values to progressively reduce the number of sampling points and speed up the rendering process.

Paper
Add Code

CLIP-FLow: Contrastive Learning by semi-supervised Iterative Pseudo labeling for Optical Flow Estimation

no code implementations • 25 Oct 2022 • Zhiqi Zhang, Nitin Bansal, Changjiang Cai, Pan Ji, Qingan Yan, Xiangyu Xu, Yi Xu

To this end, we propose CLIP-FLow, a semi-supervised iterative pseudo-labeling framework to transfer the pretraining knowledge to the target real domain.

Contrastive Learning Optical Flow Estimation +1

Paper
Add Code

STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition

no code implementations • ICCV 2023 • Ming Li, Xiangyu Xu, Hehe Fan, Pan Zhou, Jun Liu, Jia-Wei Liu, Jiahe Li, Jussi Keppo, Mike Zheng Shou, Shuicheng Yan

For the first time, we introduce vision Transformers into PPAR by treating a video as a tubelet sequence, and accordingly design two complementary mechanisms, i. e., sparsification and anonymization, to remove privacy from a spatio-temporal perspective.

Action Recognition Facial Expression Recognition (FER) +2

Paper
Add Code

Dynamic Voxel Grid Optimization for High-Fidelity RGB-D Supervised Surface Reconstruction

no code implementations • 12 Apr 2023 • Xiangyu Xu, Lichang Chen, Changjiang Cai, Huangying Zhan, Qingan Yan, Pan Ji, Junsong Yuan, Heng Huang, Yi Xu

Direct optimization of interpolated features on multi-resolution voxel grids has emerged as a more efficient alternative to MLP-like modules.

Computational Efficiency Surface Reconstruction

Paper
Add Code

Instant3D: Instant Text-to-3D Generation

no code implementations • 14 Nov 2023 • Ming Li, Pan Zhou, Jia-Wei Liu, Jussi Keppo, Min Lin, Shuicheng Yan, Xiangyu Xu

Once trained, Instant3D is able to create a 3D object for an unseen text prompt in less than one second with a single run of a feedforward network.

3D Generation Negation +1

Paper
Add Code

PlanarNeRF: Online Learning of Planar Primitives with Neural Radiance Fields

no code implementations • 30 Dec 2023 • Zheng Chen, Qingan Yan, Huangying Zhan, Changjiang Cai, Xiangyu Xu, Yuzhong Huang, Weihan Wang, Ziyue Feng, Lantao Liu, Yi Xu

Through extensive experiments, we demonstrate the effectiveness of PlanarNeRF in various scenarios and remarkable improvement over existing works.

3D Plane Detection

Paper
Add Code

InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity

no code implementations • 21 Mar 2024 • Jiabin Liang, Lanqing Zhang, Zhuoran Zhao, Xiangyu Xu

The conventional mesh-based Level of Detail (LoD) technique, exemplified by applications such as Google Earth and many game engines, exhibits the capability to holistically represent a large scene even the Earth, and achieves rendering with a space complexity of O(log n).

Paper
Add Code

GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models

no code implementations • 10 Apr 2024 • Zewei Zhang, Huan Liu, Jun Chen, Xiangyu Xu

In this paper, we introduce GoodDrag, a novel approach to improve the stability and image quality of drag editing.

Benchmarking Denoising

Paper
Add Code

Motion-adaptive Separable Collaborative Filters for Blind Motion Deblurring

no code implementations • 19 Apr 2024 • Chengxu Liu, Xuan Wang, Xiangyu Xu, Ruhao Tian, Shuai Li, Xueming Qian, Ming-Hsuan Yang

In particular, we use a motion estimation network to capture motion information from neighborhoods, thereby adaptively estimating spatially-variant motion flow, mask, kernels, weights, and offsets to obtain the MISC Filter.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.