Search Results for author: Nan Xue

Found 39 papers, 24 papers with code

InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior

no code implementations • 17 Apr 2024 • Zhiheng Liu, Hao Ouyang, Qiuyu Wang, Ka Leong Cheng, Jie Xiao, Kai Zhu, Nan Xue, Yu Liu, Yujun Shen, Yang Cao

3D Gaussians have recently emerged as an efficient representation for novel view synthesis.

Depth Completion Novel View Synthesis

Paper
Add Code

SpatialTracker: Tracking Any 2D Pixels in 3D Space

no code implementations • 5 Apr 2024 • Yuxi Xiao, Qianqian Wang, Shangzhan Zhang, Nan Xue, Sida Peng, Yujun Shen, Xiaowei Zhou

Recovering dense and long-range pixel motion in videos is a challenging problem.

Paper
Add Code

Bridging 3D Gaussian and Mesh for Freeview Video Rendering

no code implementations • 18 Mar 2024 • Yuting Xiao, Xuan Wang, Jiafei Li, Hongrui Cai, Yanbo Fan, Nan Xue, Minghui Yang, Yujun Shen, Shenghua Gao

To this end, we propose a novel approach, GauMesh, to bridge the 3D Gaussian and Mesh for modeling and rendering the dynamic scenes.

Novel View Synthesis

Paper
Add Code

ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding

1 code implementation • NeurIPS 2023 • Lunhao Duan, Shanshan Zhao, Nan Xue, Mingming Gong, Gui-Song Xia, DaCheng Tao

Transformers have been recently explored for 3D point cloud understanding with impressive progress achieved.

Ranked #5 on Semantic Segmentation on S3DIS Area5

Semantic Segmentation

Paper
Code

Cross-level Attention with Overlapped Windows for Camouflaged Object Detection

no code implementations • 28 Nov 2023 • Jiepan Li, Fangxiao Lu, Nan Xue, Zhuohong Li, Hongyan zhang, wei he

In this paper, we propose an overlapped window cross-level attention (OWinCA) to achieve the low-level feature enhancement guided by the highest-level features.

object-detection Object Detection

Paper
Add Code

Patched Line Segment Learning for Vector Road Mapping

no code implementations • 6 Sep 2023 • Jiakun Xu, Bowen Xu, Gui-Song Xia, Liang Dong, Nan Xue

In our experiments, we demonstrate how an effective representation of a road graph significantly enhances the performance of vector road mapping on established benchmarks, without requiring extensive modifications to the neural network architecture.

Paper
Add Code

NEAT: Distilling 3D Wireframes from Neural Attraction Fields

1 code implementation • 14 Jul 2023 • Nan Xue, Bin Tan, Yuxi Xiao, Liang Dong, Gui-Song Xia, Tianfu Wu, Yujun Shen

Instead of leveraging matching-based solutions from 2D wireframes (or line segments) for 3D wireframe reconstruction as done in prior arts, we present NEAT, a rendering-distilling formulation using neural fields to represent 3D line segments with 2D observations, and bipartite matching for perceiving and distilling of a sparse set of 3D global junctions.

3D Wireframe Reconstruction Novel View Synthesis

Paper
Code

Depth and DOF Cues Make A Better Defocus Blur Detector

1 code implementation • 20 Jun 2023 • Yuxin Jin, Ming Qian, Jincheng Xiong, Nan Xue, Gui-Song Xia

Our method proposes a depth feature distillation strategy to obtain depth knowledge from a pre-trained monocular depth estimation model and uses a DOF-edge loss to understand the relationship between DOF and depth.

Ranked #1 on Defocus Blur Detection on EBD

Defocus Blur Detection Monocular Depth Estimation

Paper
Code

HGFormer: Hierarchical Grouping Transformer for Domain Generalized Semantic Segmentation

1 code implementation • CVPR 2023 • Jian Ding, Nan Xue, Gui-Song Xia, Bernt Schiele, Dengxin Dai

This work studies semantic segmentation under the domain generalization setting, where a model is trained only on the source domain and tested on the unseen target domain.

Domain Generalization Segmentation +1

Paper
Code

Monocular 3D Object Detection with Bounding Box Denoising in 3D by Perceiver

no code implementations • ICCV 2023 • Xianpeng Liu, Ce Zheng, Kelvin Cheng, Nan Xue, Guo-Jun Qi, Tianfu Wu

Motivated by a new and strong observation that this challenge can be remedied by a 3D-space local-grid search scheme in an ideal case, we propose a stage-wise approach, which combines the information flow from 2D-to-3D (3D bounding box proposal generation with a single 2D image) and 3D-to-2D (proposal verification by denoising with 3D-to-2D contexts) in a top-down manner.

Denoising Monocular 3D Object Detection +1

Paper
Add Code

Sat2Density: Faithful Density Learning from Satellite-Ground Image Pairs

1 code implementation • ICCV 2023 • Ming Qian, Jincheng Xiong, Gui-Song Xia, Nan Xue

This paper aims to develop an accurate 3D geometry representation of satellite images using satellite-ground image pairs.

Ranked #1 on Cross-View Image-to-Image Translation on CVACT

Cross-View Image-to-Image Translation Generalizable Novel View Synthesis +2

Paper
Code

NOPE-SAC: Neural One-Plane RANSAC for Sparse-View Planar 3D Reconstruction

1 code implementation • 30 Nov 2022 • Bin Tan, Nan Xue, Tianfu Wu, Gui-Song Xia

This paper studies the challenging two-view 3D reconstruction in a rigorous sparse-view configuration, which is suffering from insufficient correspondences in the input image pairs for camera pose estimation.

3D Reconstruction Pose Estimation

Paper
Code

Level-S$^2$fM: Structure from Motion on Neural Level Set of Implicit Surfaces

1 code implementation • CVPR 2023 • Yuxi Xiao, Nan Xue, Tianfu Wu, Gui-Song Xia

This paper presents a neural incremental Structure-from-Motion (SfM) approach, Level-S$^2$fM, which estimates the camera poses and scene geometry from a set of uncalibrated images by learning coordinate MLPs for the implicit surfaces and the radiance fields from the established keypoint correspondences.

3D Reconstruction Neural Rendering +1

124

Paper
Code

Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning

1 code implementation • 24 Oct 2022 • Nan Xue, Tianfu Wu, Song Bai, Fu-Dong Wang, Gui-Song Xia, Liangpei Zhang, Philip H. S. Torr

This article presents Holistically-Attracted Wireframe Parsing (HAWP), a method for geometric analysis of 2D images containing wireframes formed by line segments and junctions.

Self-Supervised Learning Wireframe Parsing

278

Paper
Code

HoW-3D: Holistic 3D Wireframe Perception from a Single Image

1 code implementation • 15 Aug 2022 • Wenchao Ma, Bin Tan, Nan Xue, Tianfu Wu, Xianwei Zheng, Gui-Song Xia

This paper studies the problem of holistic 3D wireframe perception (HoW-3D), a new task of perceiving both the visible 3D wireframes and the invisible ones from single-view 2D images.

Paper
Code

Accurate Polygonal Mapping of Buildings in Satellite Imagery

1 code implementation • 1 Aug 2022 • Bowen Xu, Jiakun Xu, Nan Xue, Gui-Song Xia

We addressed such an issue by exploiting the hierarchical supervision (of bottom-level vertices, mid-level line segments and the high-level regional masks) and proposed a novel interaction mechanism of feature embedding sourced from different levels of supervision signals to obtain reversible building masks for polygonal mapping of buildings.

114

Paper
Code

Revisiting Document Image Dewarping by Grid Regularization

no code implementations • CVPR 2022 • Xiangwei Jiang, Rujiao Long, Nan Xue, Zhibo Yang, Cong Yao, Gui-Song Xia

This paper addresses the problem of document image dewarping, which aims at eliminating the geometric distortion in document images for document digitization.

Ranked #3 on Local Distortion on DocUNet

Local Distortion Optical Flow Estimation

Paper
Add Code

Partial Wasserstein Adversarial Network for Non-rigid Point Set Registration

no code implementations • ICLR 2022 • Zi-Ming Wang, Nan Xue, Ling Lei, Gui-Song Xia

To handle large point sets, we propose a scalable PDM algorithm by utilizing the efficient partial Wasserstein-1 (PW) discrepancy.

Paper
Add Code

Decoupling Zero-Shot Semantic Segmentation

1 code implementation • CVPR 2022 • Jian Ding, Nan Xue, Gui-Song Xia, Dengxin Dai

2) a zero-shot classification task on segments.

Ranked #3 on Open Vocabulary Semantic Segmentation on COCO-Stuff-171

Open Vocabulary Semantic Segmentation Segmentation +3

159

Paper
Code

Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection

2 code implementations • 9 Dec 2021 • Xianpeng Liu, Nan Xue, Tianfu Wu

It presents the MonoCon method which learns Monocular Contexts, as auxiliary tasks in training, to help monocular 3D object detection.

Ranked #5 on Monocular 3D Object Detection on KITTI Cars Moderate

Monocular 3D Object Detection Object +2

141

Paper
Code

Learning Local-Global Contextual Adaptation for Multi-Person Pose Estimation

1 code implementation • CVPR 2022 • Nan Xue, Tianfu Wu, Gui-Song Xia, Liangpei Zhang

This paper studies the problem of multi-person pose estimation in a bottom-up fashion.

Multi-Person Pose Estimation

Paper
Code

Parsing Table Structures in the Wild

2 code implementations • ICCV 2021 • Rujiao Long, Wen Wang, Nan Xue, Feiyu Gao, Zhibo Yang, Yongpan Wang, Gui-Song Xia

In contrast to existing studies that mainly focus on parsing well-aligned tabular images with simple layouts from scanned PDF documents, we aim to establish a practical table structure parsing system for real-world scenarios where tabular input images are taken or scanned with severe deformation, bending or occlusions.

Object Detection

146

Paper
Code

LUAI Challenge 2021 on Learning to Understand Aerial Images

1 code implementation • 30 Aug 2021 • Gui-Song Xia, Jian Ding, Ming Qian, Nan Xue, Jiaming Han, Xiang Bai, Michael Ying Yang, Shengyang Li, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang, Qiang Zhou, Chao-hui Yu, Kaixuan Hu, Yingjia Bu, Wenming Tan, Zhe Yang, Wei Li, Shang Liu, Jiaxuan Zhao, Tianzhi Ma, Zi-han Gao, Lingqi Wang, Yi Zuo, Licheng Jiao, Chang Meng, Hao Wang, Jiahao Wang, Yiming Hui, Zhuojun Dong, Jie Zhang, Qianyue Bao, Zixiao Zhang, Fang Liu

This report summarizes the results of Learning to Understand Aerial Images (LUAI) 2021 challenge held on ICCV 2021, which focuses on object detection and semantic segmentation in aerial images.

Object object-detection +4

258

Paper
Code

PlaneTR: Structure-Guided Transformers for 3D Plane Recovery

no code implementations • ICCV 2021 • Bin Tan, Nan Xue, Song Bai, Tianfu Wu, Gui-Song Xia

This paper presents a neural network built upon Transformers, namely PlaneTR, to simultaneously detect and reconstruct planes from a single image.

Paper
Add Code

ReDet: A Rotation-equivariant Detector for Aerial Object Detection

4 code implementations • CVPR 2021 • Jiaming Han, Jian Ding, Nan Xue, Gui-Song Xia

More precisely, we incorporate rotation-equivariant networks into the detector to extract rotation-equivariant features, which can accurately predict the orientation and lead to a huge reduction of model size.

Ranked #18 on Object Detection In Aerial Images on DOTA (using extra training data)

Object object-detection +1

1,724

Paper
Code

Deep Graph Matching under Quadratic Constraint

1 code implementation • CVPR 2021 • Quankai Gao, Fudong Wang, Nan Xue, Jin-Gang Yu, Gui-Song Xia

Recently, deep learning based methods have demonstrated promising results on the graph matching problem, by relying on the descriptive capability of deep features extracted on graph nodes.

Ranked #7 on Graph Matching on Willow Object Class

Descriptive Graph Matching

Paper
Code

Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges

2 code implementations • 24 Feb 2021 • Jian Ding, Nan Xue, Gui-Song Xia, Xiang Bai, Wen Yang, Micheal Ying Yang, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang

In this paper, we present a large-scale Dataset of Object deTection in Aerial images (DOTA) and comprehensive baselines for ODAI.

Object object-detection +1

769

Paper
Code

Unmixing Convolutional Features for Crisp Edge Detection

1 code implementation • 19 Nov 2020 • Linxi Huan, Nan Xue, Xianwei Zheng, wei he, Jianya Gong, Gui-Song Xia

This paper presents a context-aware tracing strategy (CATS) for crisp edge detection with deep edge detectors, based on an observation that the localization ambiguity of deep edge detectors is mainly caused by the mixing phenomenon of convolutional neural networks: feature mixing in edge classification and side mixing during fusing side predictions.

Ranked #2 on Edge Detection on MDBD

Edge Classification Edge Detection

Paper
Code

Zero-Assignment Constraint for Graph Matching with Outliers

1 code implementation • CVPR 2020 • Fu-Dong Wang, Nan Xue, Jin-Gang Yu, Gui-Song Xia

Graph matching (GM), as a longstanding problem in computer vision and pattern recognition, still suffers from numerous cluttered outliers in practical applications.

Graph Matching valid

Paper
Code

Fisheye Distortion Rectification from Deep Straight Lines

no code implementations • 25 Mar 2020 • Zhu-Cun Xue, Nan Xue, Gui-Song Xia

This paper presents a novel line-aware rectification network (LaRecNet) to address the problem of fisheye distortion rectification based on the classical observation that straight lines in 3D space should be still straight in image planes.

SSIM

Paper
Add Code

Holistically-Attracted Wireframe Parsing

1 code implementation • CVPR 2020 • Nan Xue, Tianfu Wu, Song Bai, Fu-Dong Wang, Gui-Song Xia, Liangpei Zhang, Philip H. S. Torr

For computing line segment proposals, a novel exact dual representation is proposed which exploits a parsimonious geometric reparameterization for line segments and forms a holistic 4-dimensional attraction field map for an input image.

Ranked #4 on Line Segment Detection on York Urban Dataset

Line Segment Detection Wireframe Parsing

278

Paper
Code

Learning Regional Attraction for Line Segment Detection

no code implementations • 18 Dec 2019 • Nan Xue, Song Bai, Fu-Dong Wang, Gui-Song Xia, Tianfu Wu, Liangpei Zhang, Philip H. S. Torr

Given a line segment map, the proposed regional attraction first establishes the relationship between line segments and regions in the image lattice.

Line Segment Detection

Paper
Add Code

Learning to Calibrate Straight Lines for Fisheye Image Rectification

no code implementations • CVPR 2019 • Zhu-Cun Xue, Nan Xue, Gui-Song Xia, Weiming Shen

This paper presents a new deep-learning based method to simultaneously calibrate the intrinsic parameters of fisheye lens and rectify the distorted images.

Paper
Add Code

A Functional Representation for Graph Matching

1 code implementation • 16 Jan 2019 • Fu-Dong Wang, Gui-Song Xia, Nan Xue, Yi-Peng Zhang, Marcello Pelillo

In this paper, we present a functional representation for graph matching (FRGM) that aims to provide more geometric insights on the problem and reduce the space and time complexities of corresponding algorithms.

Graph Matching

Paper
Code

Learning Attraction Field Representation for Robust Line Segment Detection

1 code implementation • CVPR 2019 • Nan Xue, Song Bai, Fu-Dong Wang, Gui-Song Xia, Tianfu Wu, Liangpei Zhang

In experiments, our method is tested on the WireFrame dataset and the YorkUrban dataset with state-of-the-art performance obtained.

Line Segment Detection Semantic Segmentation

289

Paper
Code

Learning RoI Transformer for Detecting Oriented Objects in Aerial Images

1 code implementation • 1 Dec 2018 • Jian Ding, Nan Xue, Yang Long, Gui-Song Xia, Qikai Lu

Especially when detecting densely packed objects in aerial images, methods relying on horizontal proposals for common object detection often introduce mismatches between the Region of Interests (RoIs) and objects.

Ranked #48 on Object Detection In Aerial Images on DOTA (using extra training data)

General Classification Object +4

191

Paper
Code

GeoSay: A Geometric Saliency for Extracting Buildings in Remote Sensing Images

no code implementations • 7 Nov 2018 • Gui-Song Xia, Jin Huang, Nan Xue, Qikai Lu, Xiaoxiang Zhu

More precisely, given an image, the geometric saliency is derived from a mid-level geometric representations based on meaningful junctions that can locally describe geometrical structures of images.

Extracting Buildings In Remote Sensing Images

Paper
Add Code

Adaptively Transforming Graph Matching

no code implementations • ECCV 2018 • Fu-Dong Wang, Nan Xue, Yi-Peng Zhang, Xiang Bai, Gui-Song Xia

Due to an efficient Frank-Wolfe method-based optimization strategy, we can handle graphs with hundreds and thousands of nodes within an acceptable amount of time.

Domain Adaptation Graph Matching

Paper
Add Code

Anisotropic-Scale Junction Detection and Matching for Indoor Images

no code implementations • 16 Mar 2017 • Nan Xue, Gui-Song Xia, Xiang Bai, Liangpei Zhang, Weiming Shen

This paper presents a novel approach to junction detection and characterization that exploits the locally anisotropic geometries of a junction and estimates the scales of these geometries using an \emph{a contrario} model.

Junction Detection

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.