Search Results for author: Gui-Song Xia

Found 71 papers, 29 papers with code

All Grains, One Scheme (AGOS): Learning Multi-grain Instance Representation for Aerial Scene Classification

no code implementations6 May 2022 Qi Bi, Beichen Zhou, Kun Qin, Qinghao Ye, Gui-Song Xia

Finally, our SSF allows our framework to learn the same scene scheme from multi-grain instance representations and fuses them, so that the entire framework is optimized as a whole.

Aerial Scene Classification Multiple Instance Learning +1

Learning to Extract Building Footprints from Off-Nadir Aerial Images

1 code implementation28 Apr 2022 Jinwang Wang, Lingxuan Meng, Weijia Li, Wen Yang, Lei Yu, Gui-Song Xia

In this paper, we propose an offset vector learning scheme, which turns the building footprint extraction problem in off-nadir images into an instance-level joint prediction problem of the building roof and its corresponding "roof to footprint" offset vector.

An Empirical Study of Remote Sensing Pretraining

1 code implementation6 Apr 2022 Di Wang, Jing Zhang, Bo Du, Gui-Song Xia, DaCheng Tao

To this end, we train different networks from scratch with the help of the largest RS scene recognition dataset up to now -- MillionAID, to obtain a series of RS pretrained backbones, including both convolutional neural networks (CNN) and vision transformers such as Swin and ViTAE, which have shown promising performance on computer vision tasks.

Change Detection Object Detection +2

Revisiting Document Image Dewarping by Grid Regularization

no code implementations31 Mar 2022 Xiangwei Jiang, Rujiao Long, Nan Xue, Zhibo Yang, Cong Yao, Gui-Song Xia

This paper addresses the problem of document image dewarping, which aims at eliminating the geometric distortion in document images for document digitization.

Optical Flow Estimation

Expanding Low-Density Latent Regions for Open-Set Object Detection

1 code implementation28 Mar 2022 Jiaming Han, Yuqiang Ren, Jian Ding, Xingjia Pan, Ke Yan, Gui-Song Xia

Thus, unknown objects in low-density regions can be easily identified with the learned unknown probability.

Contrastive Learning Object Detection

Partial Wasserstein Adversarial Network for Non-rigid Point Set Registration

no code implementations ICLR 2022 Zi-Ming Wang, Nan Xue, Ling Lei, Gui-Song Xia

To handle large point sets, we propose a scalable PDM algorithm by utilizing the efficient partial Wasserstein-1 (PW) discrepancy.

Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic Labeling

no code implementations6 Jan 2022 Yang Long, Gui-Song Xia, Liangpei Zhang, Gong Cheng, Deren Li

Finally, we perform ASP by unifying the tile-level scene classification and object-based image analysis to achieve pixel-wise semantic labeling.

Aerial Scene Classification Classification +3

Hidden Path Selection Network for Semantic Segmentation of Remote Sensing Images

no code implementations9 Dec 2021 Kunping Yang, Xin-Yi Tong, Gui-Song Xia, Weiming Shen, Liangpei Zhang

Targeting at depicting land covers with pixel-wise semantic categories, semantic segmentation in remote sensing images needs to portray diverse distributions over vast geographical locations, which is difficult to be achieved by the homogeneous pixel-wise forward paths in the architectures of existing deep models.

Semantic Segmentation

Motion Deblurring with Real Events

no code implementations ICCV 2021 Fang Xu, Lei Yu, Bishan Wang, Wen Yang, Gui-Song Xia, Xu Jia, Zhendong Qiao, Jianzhuang Liu

In this paper, we propose an end-to-end learning framework for event-based motion deblurring in a self-supervised manner, where real-world events are exploited to alleviate the performance degradation caused by data inconsistency.

Deblurring

Parsing Table Structures in the Wild

1 code implementation ICCV 2021 Rujiao Long, Wen Wang, Nan Xue, Feiyu Gao, Zhibo Yang, Yongpan Wang, Gui-Song Xia

In contrast to existing studies that mainly focus on parsing well-aligned tabular images with simple layouts from scanned PDF documents, we aim to establish a practical table structure parsing system for real-world scenarios where tabular input images are taken or scanned with severe deformation, bending or occlusions.

PlaneTR: Structure-Guided Transformers for 3D Plane Recovery

no code implementations ICCV 2021 Bin Tan, Nan Xue, Song Bai, Tianfu Wu, Gui-Song Xia

This paper presents a neural network built upon Transformers, namely PlaneTR, to simultaneously detect and reconstruct planes from a single image.

ReDet: A Rotation-equivariant Detector for Aerial Object Detection

2 code implementations CVPR 2021 Jiaming Han, Jian Ding, Nan Xue, Gui-Song Xia

More precisely, we incorporate rotation-equivariant networks into the detector to extract rotation-equivariant features, which can accurately predict the orientation and lead to a huge reduction of model size.

Object Detection In Aerial Images

Deep Graph Matching under Quadratic Constraint

1 code implementation CVPR 2021 Quankai Gao, Fudong Wang, Nan Xue, Jin-Gang Yu, Gui-Song Xia

Recently, deep learning based methods have demonstrated promising results on the graph matching problem, by relying on the descriptive capability of deep features extracted on graph nodes.

Graph Matching

Event-based Synthetic Aperture Imaging with a Hybrid Network

1 code implementation CVPR 2021 Xiang Zhang, Wei Liao, Lei Yu, Wen Yang, Gui-Song Xia

Synthetic aperture imaging (SAI) is able to achieve the see through effect by blurring out the off-focus foreground occlusions and reconstructing the in-focus occluded targets from multi-view images.

Frame Style Transfer

Bidirectional Multi-scale Attention Networks for Semantic Segmentation of Oblique UAV Imagery

1 code implementation5 Feb 2021 Ye Lyu, George Vosselman, Gui-Song Xia, Michael Ying Yang

Semantic segmentation for aerial platforms has been one of the fundamental scene understanding task for the earth observation.

Scene Understanding Semantic Segmentation

3D Building Reconstruction From Monocular Remote Sensing Images

no code implementations ICCV 2021 Weijia Li, Lingxuan Meng, Jinwang Wang, Conghui He, Gui-Song Xia, Dahua Lin

3D building reconstruction from monocular remote sensing imagery is an important research problem and an economic solution to large-scale city modeling, compared with reconstruction from LiDAR data and multi-view imagery.

3D Reconstruction

Unmixing Convolutional Features for Crisp Edge Detection

1 code implementation19 Nov 2020 Linxi Huan, Nan Xue, Xianwei Zheng, wei he, Jianya Gong, Gui-Song Xia

This paper presents a context-aware tracing strategy (CATS) for crisp edge detection with deep edge detectors, based on an observation that the localization ambiguity of deep edge detectors is mainly caused by the mixing phenomenon of convolutional neural networks: feature mixing in edge classification and side mixing during fusing side predictions.

BSDS500 Edge Classification +1

Semantic Change Detection with Asymmetric Siamese Networks

no code implementations12 Oct 2020 Kunping Yang, Gui-Song Xia, Zicheng Liu, Bo Du, Wen Yang, Marcello Pelillo, Liangpei Zhang

Given two multi-temporal aerial images, semantic change detection aims to locate the land-cover variations and identify their change types with pixel-wise boundaries.

Change Detection

Mixed Noise Removal with Pareto Prior

no code implementations27 Aug 2020 Zhou Liu, Lei Yu, Gui-Song Xia, Hong Sun

To address this problem, we exploit the Pareto distribution as the priori of the weighting matrix, based on which an accurate and robust weight estimator is proposed for mixed noise removal.

Denoising

Align Deep Features for Oriented Object Detection

3 code implementations21 Aug 2020 Jiaming Han, Jian Ding, Jie Li, Gui-Song Xia

However most of existing methods rely on heuristically defined anchors with different scales, angles and aspect ratios and usually suffer from severe misalignment between anchor boxes and axis-aligned convolutional features, which leads to the common inconsistency between the classification score and localization accuracy.

Object Detection In Aerial Images

Event Enhanced High-Quality Image Recovery

1 code implementation ECCV 2020 Bishan Wang, Jingwei He, Lei Yu, Gui-Song Xia, Wen Yang

To recover high-quality intensity images, one should address both denoising and super-resolution problems for event cameras.

Denoising Frame +2

Implicit Euler ODE Networks for Single-Image Dehazing

no code implementations13 Jul 2020 Jiawei Shen, Zhuoyan Li, Lei Yu, Gui-Song Xia, Wen Yang

Deep convolutional neural networks (CNN) have been applied for image dehazing tasks, where the residual network (ResNet) is often adopted as the basic component to avoid the vanishing gradient problem.

Image Dehazing Single Image Dehazing

On Creating Benchmark Dataset for Aerial Image Interpretation: Reviews, Guidances and Million-AID

no code implementations22 Jun 2020 Yang Long, Gui-Song Xia, Shengyang Li, Wen Yang, Michael Ying Yang, Xiao Xiang Zhu, Liangpei Zhang, Deren Li

After reviewing existing benchmark datasets in the research community of RS image interpretation, this article discusses the problem of how to efficiently prepare a suitable benchmark dataset for RS image interpretation.

General Classification Image Classification +1

Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities

no code implementations3 May 2020 Gong Cheng, Xingxing Xie, Junwei Han, Lei Guo, Gui-Song Xia

Considering the rapid evolution of this field, this paper provides a systematic survey of deep learning methods for remote sensing image scene classification by covering more than 160 papers.

Classification General Classification +1

FGN: Fully Guided Network for Few-Shot Instance Segmentation

no code implementations CVPR 2020 Zhibo Fan, Jin-Gang Yu, Zhihao Liang, Jiarong Ou, Changxin Gao, Gui-Song Xia, Yuanqing Li

Few-shot instance segmentation (FSIS) conjoins the few-shot learning paradigm with general instance segmentation, which provides a possible way of tackling instance segmentation in the lack of abundant labeled data for training.

Few-Shot Learning Instance Segmentation +1

Zero-Assignment Constraint for Graph Matching with Outliers

1 code implementation CVPR 2020 Fu-Dong Wang, Nan Xue, Jin-Gang Yu, Gui-Song Xia

Graph matching (GM), as a longstanding problem in computer vision and pattern recognition, still suffers from numerous cluttered outliers in practical applications.

Graph Matching

Fisheye Distortion Rectification from Deep Straight Lines

no code implementations25 Mar 2020 Zhu-Cun Xue, Nan Xue, Gui-Song Xia

This paper presents a novel line-aware rectification network (LaRecNet) to address the problem of fisheye distortion rectification based on the classical observation that straight lines in 3D space should be still straight in image planes.

SSIM

Semantic Change Pattern Analysis

no code implementations7 Mar 2020 Wensheng Cheng, Yan Zhang, Xu Lei, Wen Yang, Gui-Song Xia

Change detection is an important problem in vision field, especially for aerial images.

Change Detection

Holistically-Attracted Wireframe Parsing

1 code implementation CVPR 2020 Nan Xue, Tianfu Wu, Song Bai, Fu-Dong Wang, Gui-Song Xia, Liangpei Zhang, Philip H. S. Torr

For computing line segment proposals, a novel exact dual representation is proposed which exploits a parsimonious geometric reparameterization for line segments and forms a holistic 4-dimensional attraction field map for an input image.

Line Segment Detection

Matching Neuromorphic Events and Color Images via Adversarial Learning

no code implementations2 Mar 2020 Fang Xu, ShiJie Lin, Wen Yang, Lei Yu, Dengxin Dai, Gui-Song Xia

The event camera has appealing properties: high dynamic range, low latency, low power consumption and low memory usage, and thus provides complementariness to conventional frame-based cameras.

Frame Image Retrieval

Plug & Play Convolutional Regression Tracker for Video Object Detection

2 code implementations2 Mar 2020 Ye Lyu, Michael Ying Yang, George Vosselman, Gui-Song Xia

As the tracker reuses the features from the detector, it is a very light-weighted increment to the detection network.

Video Object Detection

Learning Regional Attraction for Line Segment Detection

no code implementations18 Dec 2019 Nan Xue, Song Bai, Fu-Dong Wang, Gui-Song Xia, Tianfu Wu, Liangpei Zhang, Philip H. S. Torr

Given a line segment map, the proposed regional attraction first establishes the relationship between line segments and regions in the image lattice.

Line Segment Detection

Conditional Generative ConvNets for Exemplar-based Texture Synthesis

1 code implementation17 Dec 2019 Zi-Ming Wang, Meng-Han Li, Gui-Song Xia

Given a texture exemplar, the cgCNN model defines a conditional distribution using deep statistics of a ConvNet, and synthesize new textures by sampling from the conditional distribution.

Texture Synthesis

Gliding vertex on the horizontal bounding box for multi-oriented object detection

1 code implementation21 Nov 2019 Yongchao Xu, Mingtao Fu, Qimeng Wang, Yukang Wang, Kai Chen, Gui-Song Xia, Xiang Bai

Yet, the widely adopted horizontal bounding box representation is not appropriate for ubiquitous oriented objects such as objects in aerial images and scene texts.

Object Detection In Aerial Images Pedestrian Detection +1

LIP: Learning Instance Propagation for Video Object Segmentation

no code implementations30 Sep 2019 Ye Lyu, George Vosselman, Gui-Song Xia, Michael Ying Yang

In recent years, the task of segmenting foreground objects from background in a video, i. e. video object segmentation (VOS), has received considerable attention.

Data Augmentation Instance Segmentation +3

iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images

3 code implementations30 May 2019 Syed Waqas Zamir, Aditya Arora, Akshita Gupta, Salman Khan, Guolei Sun, Fahad Shahbaz Khan, Fan Zhu, Ling Shao, Gui-Song Xia, Xiang Bai

Compared to existing small-scale aerial image based instance segmentation datasets, iSAID contains 15$\times$ the number of object categories and 5$\times$ the number of instances.

Instance Segmentation Object Detection +1

Learning to Calibrate Straight Lines for Fisheye Image Rectification

no code implementations CVPR 2019 Zhu-Cun Xue, Nan Xue, Gui-Song Xia, Weiming Shen

This paper presents a new deep-learning based method to simultaneously calibrate the intrinsic parameters of fisheye lens and rectify the distorted images.

A Functional Representation for Graph Matching

1 code implementation16 Jan 2019 Fu-Dong Wang, Gui-Song Xia, Nan Xue, Yi-Peng Zhang, Marcello Pelillo

In this paper, we present a functional representation for graph matching (FRGM) that aims to provide more geometric insights on the problem and reduce the space and time complexities of corresponding algorithms.

Graph Matching

Mini-Unmanned Aerial Vehicle-Based Remote Sensing: Techniques, Applications, and Prospects

no code implementations19 Dec 2018 Tian-Zhu Xiang, Gui-Song Xia, Liangpei Zhang

We hope this paper will provide remote-sensing researchers an overall picture of recent UAV-based remote sensing developments and help guide the further research on this topic.

Learning Attraction Field Representation for Robust Line Segment Detection

1 code implementation CVPR 2019 Nan Xue, Song Bai, Fu-Dong Wang, Gui-Song Xia, Tianfu Wu, Liangpei Zhang

In experiments, our method is tested on the WireFrame dataset and the YorkUrban dataset with state-of-the-art performance obtained.

Ranked #4 on Line Segment Detection on York Urban Dataset (using extra training data)

Line Segment Detection Semantic Segmentation

Learning RoI Transformer for Detecting Oriented Objects in Aerial Images

1 code implementation1 Dec 2018 Jian Ding, Nan Xue, Yang Long, Gui-Song Xia, Qikai Lu

Especially when detecting densely packed objects in aerial images, methods relying on horizontal proposals for common object detection often introduce mismatches between the Region of Interests (RoIs) and objects.

General Classification Object Detection In Aerial Images

GeoSay: A Geometric Saliency for Extracting Buildings in Remote Sensing Images

no code implementations7 Nov 2018 Gui-Song Xia, Jin Huang, Nan Xue, Qikai Lu, Xiaoxiang Zhu

More precisely, given an image, the geometric saliency is derived from a mid-level geometric representations based on meaningful junctions that can locally describe geometrical structures of images.

Extracting Buildings In Remote Sensing Images

UAVid: A Semantic Segmentation Dataset for UAV Imagery

2 code implementations24 Oct 2018 Ye Lyu, George Vosselman, Gui-Song Xia, Alper Yilmaz, Michael Ying Yang

There already exist several semantic segmentation datasets for comparison among semantic segmentation methods in complex urban scenes, such as the Cityscapes and CamVid datasets, where the side views of the objects are captured with a camera mounted on the driving car.

Autonomous Driving Object Recognition +3

Texture Mixing by Interpolating Deep Statistics via Gaussian Models

no code implementations29 Jul 2018 Zi-Ming Wang, Gui-Song Xia, Yi-Peng Zhang

More precisely, we first reveal that the statistics used in existing deep models can be unified using a stationary Gaussian scheme.

Style Transfer Texture Synthesis

Adaptively Transforming Graph Matching

no code implementations ECCV 2018 Fu-Dong Wang, Nan Xue, Yi-Peng Zhang, Xiang Bai, Gui-Song Xia

Due to an efficient Frank-Wolfe method-based optimization strategy, we can handle graphs with hundreds and thousands of nodes within an acceptable amount of time.

Domain Adaptation Graph Matching

Land-Cover Classification with High-Resolution Remote Sensing Images Using Transferable Deep Models

no code implementations16 Jul 2018 Xin-Yi Tong, Gui-Song Xia, Qikai Lu, Huanfeng Shen, Shengyang Li, Shucheng You, Liangpei Zhang

The main idea is to rely on deep neural networks for presenting the contextual information contained in different types of land-covers and propose a pseudo-labeling and sample selection scheme for improving the transferability of deep models.

Classification General Classification

Large-scale Land Cover Classification in GaoFen-2 Satellite Imagery

no code implementations4 Jun 2018 Xin-Yi Tong, Qikai Lu, Gui-Song Xia, Liangpei Zhang

Many significant applications need land cover information of remote sensing images that are acquired from different areas and times, such as change detection and disaster monitoring.

Change Detection Classification +1

Recent advances and opportunities in scene classification of aerial images with deep models

no code implementations4 Jun 2018 Fan Hu, Gui-Song Xia, Wen Yang, Liangpei Zhang

Scene classification is a fundamental task in interpretation of remote sensing images, and has become an active research topic in remote sensing community due to its important role in a wide range of applications.

Classification General Classification +1

Accurate Building Detection in VHR Remote Sensing Images using Geometric Saliency

no code implementations4 Jun 2018 Jin Huang, Gui-Song Xia, Fan Hu, Liangpei Zhang

This paper aims to address the problem of detecting buildings from remote sensing images with very high resolution (VHR).

AID++: An Updated Version of AID on Scene Classification

no code implementations3 Jun 2018 Pu Jin, Gui-Song Xia, Fan Hu, Qikai Lu, Liangpei Zhang

Aerial image scene classification is a fundamental problem for understanding high-resolution remote sensing images and has become an active research task in the field of remote sensing due to its important role in a wide range of applications.

Aerial Scene Classification Classification +2

Learning the Synthesizability of Dynamic Texture Samples

no code implementations3 Feb 2018 Feng Yang, Gui-Song Xia, Dengxin Dai, Liangpei Zhang

In this paper, we investigate the synthesizability of dynamic texture samples: {\em given a dynamic texture sample, how synthesizable it is by using EDTS, and which EDTS method is the most suitable to synthesize it?}

Texture Synthesis

Deep learning in remote sensing: a review

1 code implementation11 Oct 2017 Xiao Xiang Zhu, Devis Tuia, Lichao Mou, Gui-Song Xia, Liangpei Zhang, Feng Xu, Friedrich Fraundorfer

In this article, we analyze the challenges of using deep learning for remote sensing data analysis, review the recent advances, and provide resources to make deep learning in remote sensing ridiculously simple to start with.

Exploiting Deep Features for Remote Sensing Image Retrieval: A Systematic Investigation

no code implementations23 Jul 2017 Xin-Yi Tong, Gui-Song Xia, Fan Hu, Yanfei Zhong, Mihai Datcu, Liangpei Zhang

Over the past two decades, a large amount of research on this task has been carried out, which mainly focuses on the following three core issues: feature extraction, similarity metric and relevance feedback.

Image Retrieval

Anisotropic-Scale Junction Detection and Matching for Indoor Images

no code implementations16 Mar 2017 Nan Xue, Gui-Song Xia, Xiang Bai, Liangpei Zhang, Weiming Shen

This paper presents a novel approach to junction detection and characterization that exploits the locally anisotropic geometries of a junction and estimates the scales of these geometries using an \emph{a contrario} model.

Junction Detection

Image Stitching by Line-guided Local Warping with Global Similarity Constraint

no code implementations25 Feb 2017 Tian-Zhu Xiang, Gui-Song Xia, Xiang Bai, Liangpei Zhang

On one hand, the line features are integrated into a local warping model through a designed weight function.

Image Stitching

Texture Characterization by Using Shape Co-occurrence Patterns

no code implementations10 Feb 2017 Gui-Song Xia, Gang Liu, Xiang Bai, Liangpei Zhang

In contrast with existing works, the proposed method not only inherits the strong ability to depict geometrical aspects of textures and the high robustness to variations of imaging conditions from the shape-based method, but also provides a flexible way to consider shape relationships and to compute high-order statistics on the tree.

Texture Classification

Multi-feature combined cloud and cloud shadow detection in GaoFen-1 wide field of view imagery

no code implementations17 Jun 2016 Zhiwei Li, Huanfeng Shen, Huifang Li, Gui-Song Xia, Paolo Gamba, Liangpei Zhang

In this paper, an automatic multi-feature combined (MFC) method is proposed for cloud and cloud shadow detection in GF-1 WFV imagery.

Cloud Detection Shadow Detection

Image stitching with perspective-preserving warping

no code implementations17 May 2016 Tian-Zhu Xiang, Gui-Song Xia, Liangpei Zhang

Image stitching algorithms often adopt the global transformation, such as homography, and work well for planar scenes or parallax free camera motions.

Image Stitching

Texture Synthesis Through Convolutional Neural Networks and Spectrum Constraints

2 code implementations4 May 2016 Gang Liu, Yann Gousseau, Gui-Song Xia

This paper presents a significant improvement for the synthesis of texture images using convolutional neural networks (CNNs), making use of constraints on the Fourier spectrum of the results.

Texture Synthesis

Dense v.s. Sparse: A Comparative Study of Sampling Analysis in Scene Classification of High-Resolution Remote Sensing Imagery

no code implementations4 Feb 2015 Jingwen Hu, Gui-Song Xia, Fan Hu, Liangpei Zhang

The experimental results on two commonly used datasets show that dense sampling has the best performance among all the strategies but with high spatial and computational complexity, random sampling gives better or comparable results than other sparse sampling methods, like the sophisticated multi-scale key-point operators and the saliency-based methods which are intensively studied and commonly used recently.

Classification General Classification +2

Meaningful Objects Segmentation from SAR Images via A Multi-Scale Non-Local Active Contour Model

no code implementations17 Jan 2015 Gui-Song Xia, Gang Liu, Wen Yang

The segmentation of synthetic aperture radar (SAR) images is a longstanding yet challenging task, not only because of the presence of speckle, but also due to the variations of surface backscattering properties in the images.

Semantic Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.