Search Results for author: Xiaogang Xu

Found 43 papers, 22 papers with code

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

1 code implementation16 Apr 2024 Bin Ren, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang, Wei Zhai, Renjing Pei, Jiaming Guo, Songcen Xu, Yang Cao, ZhengJun Zha, Yan Wang, Yi Liu, Qing Wang, Gang Zhang, Liou Zhang, Shijie Zhao, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Xin Liu, Min Yan, Menghan Zhou, Yiqiang Yan, Yixuan Liu, Wensong Chan, Dehua Tang, Dong Zhou, Li Wang, Lu Tian, Barsoum Emad, Bohan Jia, Junbo Qiao, Yunshuai Zhou, Yun Zhang, Wei Li, Shaohui Lin, Shenglong Zhou, Binbin Chen, Jincheng Liao, Suiyi Zhao, Zhao Zhang, Bo wang, Yan Luo, Yanyan Wei, Feng Li, Mingshen Wang, Yawei Li, Jinhan Guan, Dehua Hu, Jiawei Yu, Qisheng Xu, Tao Sun, Long Lan, Kele Xu, Xin Lin, Jingtong Yue, Lehan Yang, Shiyi Du, Lu Qi, Chao Ren, Zeyu Han, YuHan Wang, Chaolin Chen, Haobo Li, Mingjun Zheng, Zhongbao Yang, Lianhong Song, Xingzhuo Yan, Minghan Fu, Jingyi Zhang, Baiang Li, Qi Zhu, Xiaogang Xu, Dan Guo, Chunle Guo, Jiadi Chen, Huanhuan Long, Chunjiang Duanmu, Xiaoyan Lei, Jie Liu, Weilin Jia, Weifeng Cao, Wenlong Zhang, Yanyu Mao, Ruilong Guo, Nihao Zhang, Qian Wang, Manoj Pandey, Maksym Chernozhukov, Giang Le, Shuli Cheng, Hongyuan Wang, Ziyan Wei, Qingting Tang, Liejun Wang, Yongming Li, Yanhui Guo, Hao Xu, Akram Khatami-Rizi, Ahmad Mahmoudi-Aznaveh, Chih-Chung Hsu, Chia-Ming Lee, Yi-Shiuan Chou, Amogh Joshi, Nikhil Akalwadi, Sampada Malagi, Palani Yashaswini, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi

In sub-track 1, the practical runtime performance of the submissions was evaluated, and the corresponding score was used to determine the ranking.

Image Super-Resolution

Boosting Image Restoration via Priors from Pre-trained Models

no code implementations11 Mar 2024 Xiaogang Xu, Shu Kong, Tao Hu, Zhe Liu, Hujun Bao

Pre-trained models with large-scale training data, such as CLIP and Stable Diffusion, have demonstrated remarkable performance in various high-level computer vision tasks such as image understanding and generation from language descriptions.

Deblurring Denoising +2

Learning to Remove Wrinkled Transparent Film with Polarized Prior

1 code implementation7 Mar 2024 Jiaqi Tang, Ruizheng Wu, Xiaogang Xu, Sixing Hu, Ying-Cong Chen

We aim to remove interference from the film (specular highlights and other degradations) with an end-to-end framework.

Film Removal

UniMODE: Unified Monocular 3D Object Detection

no code implementations28 Feb 2024 Zhuoling Li, Xiaogang Xu, SerNam Lim, Hengshuang Zhao

To address these challenges, we build a detector based on the bird's-eye-view (BEV) detection paradigm, where the explicit feature projection is beneficial to addressing the geometry learning ambiguity when employing multiple scenarios of data to train detectors.

Monocular 3D Object Detection Object +2

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

3 code implementations19 Jan 2024 Lihe Yang, Bingyi Kang, Zilong Huang, Xiaogang Xu, Jiashi Feng, Hengshuang Zhao

To this end, we scale up the dataset by designing a data engine to collect and automatically annotate large-scale unlabeled data (~62M), which significantly enlarges the data coverage and thus is able to reduce the generalization error.

Ranked #3 on Monocular Depth Estimation on NYU-Depth V2 (using extra training data)

Data Augmentation Monocular Depth Estimation +1

Video Frame Interpolation with Region-Distinguishable Priors from SAM

no code implementations26 Dec 2023 Yan Han, Xiaogang Xu, Yingqi Lin, Jiafei Wu, Zhe Liu

In existing Video Frame Interpolation (VFI) approaches, the motion estimation between neighboring frames plays a crucial role.

Motion Estimation Video Frame Interpolation

Geometric-Aware Low-Light Image and Video Enhancement via Depth Guidance

no code implementations26 Dec 2023 Yingqi Lin, Xiaogang Xu, Yan Han, Jiafei Wu, Zhe Liu

First, a depth-aware feature extraction module is designed to inject depth priors into the image representation.

Video Enhancement

Self-supervised Learning for Enhancing Geometrical Modeling in 3D-Aware Generative Adversarial Network

no code implementations19 Dec 2023 Jiarong Guo, Xiaogang Xu, Hengshuang Zhao

To address this, we present a Self-Supervised Learning (SSL) technique tailored as an auxiliary loss for any 3D-GAN, designed to improve its 3D geometrical modeling capabilities.

Generative Adversarial Network Self-Supervised Learning +1

CorresNeRF: Image Correspondence Priors for Neural Radiance Fields

1 code implementation NeurIPS 2023 Yixing Lao, Xiaogang Xu, Zhipeng Cai, Xihui Liu, Hengshuang Zhao

We present CorresNeRF, a novel method that leverages image correspondence priors computed by off-the-shelf methods to supervise NeRF training.

Novel View Synthesis Surface Reconstruction

Diffusion Noise Feature: Accurate and Fast Generated Image Detection

1 code implementation5 Dec 2023 Yichi Zhang, Xiaogang Xu

DNF is extracted from the estimated noise generated during the inverse diffusion process.

Refine, Discriminate and Align: Stealing Encoders via Sample-Wise Prototypes and Multi-Relational Extraction

no code implementations1 Dec 2023 Shuchi Wu, Chuan Ma, Kang Wei, Xiaogang Xu, Ming Ding, Yuwen Qian, Tao Xiang

This paper introduces RDA, a pioneering approach designed to address two primary deficiencies prevalent in previous endeavors aiming at stealing pre-trained encoders: (1) suboptimal performances attributed to biased optimization objectives, and (2) elevated query costs stemming from the end-to-end paradigm that necessitates querying the target encoder every epoch.

Clarity ChatGPT: An Interactive and Adaptive Processing System for Image Restoration and Enhancement

no code implementations20 Nov 2023 Yanyan Wei, Zhao Zhang, Jiahuan Ren, Xiaogang Xu, Richang Hong, Yi Yang, Shuicheng Yan, Meng Wang

The generalization capability of existing image restoration and enhancement (IRE) methods is constrained by the limited pre-trained datasets, making it difficult to handle agnostic inputs such as different degradation levels and scenarios beyond their design scopes.

Image Restoration Language Modelling

LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching

1 code implementation19 Nov 2023 Yixun Liang, Xin Yang, Jiantao Lin, Haodong Li, Xiaogang Xu, Yingcong Chen

The recent advancements in text-to-3D generation mark a significant milestone in generative models, unlocking new possibilities for creating imaginative 3D assets across various real-world scenarios.

3D Generation Text to 3D

Application of a Dense Fusion Attention Network in Fault Diagnosis of Centrifugal Fan

no code implementations12 Nov 2023 Ruijun Wang, YuAn Liu, Zhixia Fan, Xiaogang Xu, Huijie Wang

However, it is still a challenge to understand the correspondence between the structure and function of the model and the diagnosis process.

SSL-Auth: An Authentication Framework by Fragile Watermarking for Pre-trained Encoders in Self-supervised Learning

no code implementations9 Aug 2023 Xiaobei Li, Changchun Yin, Liyue Zhu, Xiaogang Xu, Liming Fang, Run Wang, Chenhao Lin

Self-supervised learning (SSL), a paradigm harnessing unlabeled datasets to train robust encoders, has recently witnessed substantial success.

Self-Supervised Learning

High Dynamic Range Image Reconstruction via Deep Explicit Polynomial Curve Estimation

1 code implementation31 Jul 2023 Jiaqi Tang, Xiaogang Xu, Sixing Hu, Ying-Cong Chen

Besides, since all current datasets do not provide the corresponding relationship between the tone mapping function and the LDR image, we construct a new dataset with both synthetic and real images.

HDR Reconstruction Image Reconstruction +2

Lighting up NeRF via Unsupervised Decomposition and Enhancement

1 code implementation ICCV 2023 Haoyuan Wang, Xiaogang Xu, Ke Xu, Rynson WH. Lau

Neural Radiance Field (NeRF) is a promising approach for synthesizing novel views, given a set of images and the corresponding camera poses of a scene.

Low-Light Image Enhancement

Low-Light Image Enhancement via Structure Modeling and Guidance

1 code implementation CVPR 2023 Xiaogang Xu, RuiXing Wang, Jiangbo Lu

Moreover, to improve the appearance modeling, which is implemented with a simple U-Net, a novel structure-guided enhancement module is proposed with structure-guided feature synthesis layers.

Edge Detection Low-Light Image Enhancement

Leaf Cultivar Identification via Prototype-enhanced Learning

no code implementations5 May 2023 Yiyi Zhang, Zhiwen Ying, Ying Zheng, Cuiling Wu, Nannan Li, Jun Wang, Xianzhong Feng, Xiaogang Xu

Plant leaf identification is crucial for biodiversity protection and conservation and has gradually attracted the attention of academia in recent years.

Fine-Grained Image Classification

TriVol: Point Cloud Rendering via Triple Volumes

1 code implementation CVPR 2023 Tao Hu, Xiaogang Xu, Ruihang Chu, Jiaya Jia

However, artifacts still appear in rendered images, due to the challenges in extracting continuous and discriminative 3D features from point clouds.

Point2Pix: Photo-Realistic Point Cloud Rendering via Neural Radiance Fields

no code implementations CVPR 2023 Tao Hu, Xiaogang Xu, Shu Liu, Jiaya Jia

Also, we present Point Encoding to build Multi-scale Radiance Fields that provide discriminative 3D point features.

valid

Out-of-domain GAN inversion via Invertibility Decomposition for Photo-Realistic Human Face Manipulation

1 code implementation ICCV 2023 Xin Yang, Xiaogang Xu, Yingcong Chen

In this paper, we propose a novel framework that enhances the fidelity of human face inversion by designing a new module to decompose the input images to ID and OOD partitions with invertibility masks.

Attribute

S2WAT: Image Style Transfer via Hierarchical Vision Transformer using Strips Window Attention

1 code implementation22 Oct 2022 Chiyu Zhang, Xiaogang Xu, Lei Wang, Zaiyan Dai, Jun Yang

Transformer's recent integration into style transfer leverages its proficiency in establishing long-range dependencies, albeit at the expense of attenuated local modeling.

Style Transfer

Local Low-light Image Enhancement via Region-Aware Normalization

no code implementations16 Aug 2022 Shihurong Yao, Yizhan Huang, Xiaogang Xu

RANLEN uses a dynamically designed mask-based normalization operation, which enhances an image in a spatially varying manner, ensuring that the enhancement results are consistent with the requirements specified by the input mask.

Low-Light Image Enhancement

DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation

1 code implementation20 Jul 2022 Xin Lai, Zhuotao Tian, Xiaogang Xu, Yingcong Chen, Shu Liu, Hengshuang Zhao, LiWei Wang, Jiaya Jia

Unsupervised domain adaptation in semantic segmentation has been raised to alleviate the reliance on expensive pixel-wise annotations.

Segmentation Semantic Segmentation +2

Universal Adaptive Data Augmentation

no code implementations14 Jul 2022 Xiaogang Xu, Hengshuang Zhao

Different from existing methods, UADA would adaptively update DA's parameters according to the target model's gradient information during training: given a pre-defined set of DA operations, we randomly decide types and magnitudes of DA operations for every data batch during training, and adaptively update DA's parameters along the gradient direction of the loss concerning DA's parameters.

Data Augmentation Image Classification +3

Deep Parametric 3D Filters for Joint Video Denoising and Illumination Enhancement in Video Super Resolution

1 code implementation5 Jul 2022 Xiaogang Xu, RuiXing Wang, Chi-Wing Fu, Jiaya Jia

Despite the quality improvement brought by the recent methods, video super-resolution (SR) is still very challenging, especially for videos that are low-light and noisy.

Denoising Video Denoising +1

Towards Real-World Video Denosing: A Practical Video Denosing Dataset and Network

no code implementations4 Jul 2022 Xiaogang Xu, Yitong Yu, Nianjuan Jiang, Jiangbo Lu, Bei Yu, Jiaya Jia

Moreover, we also propose a new video denoising framework, called Recurrent Video Denoising Transformer (RVDT), which can achieve SOTA performance on PVDD and other current video denoising benchmarks.

Denoising Video Denoising

How Well Do Self-Supervised Methods Perform in Cross-Domain Few-Shot Learning?

no code implementations18 Feb 2022 Yiyi Zhang, Ying Zheng, Xiaogang Xu, Jun Wang

In this paper, we investigate the role of self-supervised representation learning in the context of CDFSL via a thorough evaluation of existing methods.

cross-domain few-shot learning Representation Learning +1

SNR-Aware Low-Light Image Enhancement

1 code implementation CVPR 2022 Xiaogang Xu, RuiXing Wang, Chi-Wing Fu, Jiaya Jia

They are long-range operations for image regions of extremely low Signal-to-Noise-Ratio (SNR) and short-range operations for other regions.

Low-Light Image Enhancement

Conditional Temporal Variational AutoEncoder for Action Video Prediction

no code implementations12 Aug 2021 Xiaogang Xu, Yi Wang, LiWei Wang, Bei Yu, Jiaya Jia

To synthesize a realistic action sequence based on a single human image, it is crucial to model both motion patterns and diversity in the action video.

motion prediction Video Prediction

Self-Supervised 3D Mesh Reconstruction From Single Images

no code implementations CVPR 2021 Tao Hu, LiWei Wang, Xiaogang Xu, Shu Liu, Jiaya Jia

Recent single-view 3D reconstruction methods reconstruct object's shape and texture from a single image with only 2D image-level annotation.

3D Reconstruction Attribute +2

General Adversarial Defense via Pixel Level and Feature Level Distribution Alignment

no code implementations1 Jan 2021 Xiaogang Xu, Hengshuang Zhao, Philip Torr, Jiaya Jia

Specifically, compared with previous methods, we propose a more efficient pixel-level training constraint to weaken the hardness of aligning adversarial samples to clean samples, which can thus obviously enhance the robustness on adversarial samples.

Adversarial Defense Image Classification +3

Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation

1 code implementation ICCV 2021 Xiaogang Xu, Hengshuang Zhao, Jiaya Jia

Adversarial training is promising for improving robustness of deep neural networks towards adversarial perturbations, especially on the classification task.

Segmentation Semantic Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.