no code implementations • 16 Sep 2024 • Hanbo Bi, Yingchao Feng, Wenhui Diao, Peijin Wang, Yongqiang Mao, Kun fu, Hongqi Wang, Xian Sun
For more efficient generalization to unseen domains (classes), most Few-shot Segmentation (FSS) would directly exploit pre-trained encoders and only fine-tune the decoder, especially in the current era of large models.
no code implementations • 28 Aug 2024 • Fanglong Yao, Yuanchang Yue, Youzhi Liu, Xian Sun, Kun fu
Aerospace embodied intelligence aims to empower unmanned aerial vehicles (UAVs) and other aerospace platforms to achieve autonomous perception, cognition, and action, as well as egocentric active interaction with humans and the environment.
no code implementations • 30 May 2024 • Yong-Qiang Mao, Hanbo Bi, Xuexue Li, Kaiqiang Chen, Zhirui Wang, Xian Sun, Kun fu
Thanks to the application of deep learning technology in point cloud processing of the remote sensing field, point cloud segmentation has become a research hotspot in recent years, which can be applied to real-world 3D, smart cities, and other fields.
no code implementations • 27 May 2024 • Yong-Qiang Mao, Hanbo Bi, Liangyu Xu, Kaiqiang Chen, Zhirui Wang, Xian Sun, Kun fu
To solve the above problem, we re-examine the deformable learning method in the Multi-View Stereo task and propose a novel paradigm based on view Space and Depth deformable Learning (SDL-MVS), aiming to learn deformable interactions of features in different view spaces and deformably model the depth ranges and intervals to enable high accurate depth estimation.
no code implementations • 27 Mar 2024 • Liangyu Xu, Wanxuan Lu, Hongfeng Yu, Yongqiang Mao, Hanbo Bi, Chenglong Liu, Xian Sun, Kun fu
To address this issue, we introduce a novel task called Target-Aware Aerial Video Prediction, aiming to simultaneously predict future scenes and motion states of the target.
no code implementations • 28 Feb 2024 • Liangyu Xu, Wanxuan Lu, Hongfeng Yu, Fanglong Yao, Xian Sun, Kun fu
The model leverages stacked multiple SFT-Blocks to not only mine the correlation of the spatiotemporal dynamics of echo cells but also avoid the mutual interference between the temporal modeling and the spatial morphology refinement by decoupling them.
no code implementations • 26 Jan 2024 • Kun fu, Ying Dai
The method localizes the candidate regions of the ingredients using the locating and sliding window techniques.
no code implementations • CVPR 2024 • Jiaming Zhuo, Feiyang Qin, Can Cui, Kun fu, bingxin niu, Mengzhu Wang, Yuanfang Guo, Chuan Wang, Zhen Wang, Xiaochun Cao, Liang Yang
Graph Contrastive Learning (GCL) a Self-Supervised Learning (SSL) architecture tailored for graphs has shown notable potential for mitigating label scarcity.
no code implementations • 10 Apr 2023 • Guoru Zhou, Zhongqiu Xu, Yizhe Fan, Zhe Zhang, Xiaolan Qiu, Bingchen Zhang, Kun fu, Yirong Wu
High-resolution is a key trend in the development of synthetic aperture radar (SAR), which enables the capture of fine details and accurate representation of backscattering properties.
no code implementations • 11 Jan 2023 • Yongqiang Mao, Kaiqiang Chen, Liangjin Zhao, Wei Chen, Deke Tang, Wenjie Liu, Zhirui Wang, Wenhui Diao, Xian Sun, Kun fu
Our Building3D is rooted in the SFFDE network for building elevation prediction, synchronized with a building extraction network for building masks, and then sequentially performs point cloud reconstruction, surface reconstruction (or CityGML model reconstruction).
no code implementations • 27 Nov 2022 • Xiaonan Lu, Wenhui Diao, Yongqiang Mao, Junxi Li, Peijin Wang, Xian Sun, Kun fu
Few-shot object detection, expecting detectors to detect novel classes with a few instances, has made conspicuous progress.
1 code implementation • 21 Jul 2022 • Yongqiang Mao, Kaiqiang Chen, Wenhui Diao, Xian Sun, Xiaonan Lu, Kun fu, Martin Weinmann
With receptive field fusion-and-stratification, RFFS-Net is more adaptable to the classification of regions with complex structures and extreme scale variations in large-scale ALS point clouds.
1 code implementation • 21 Apr 2022 • Zhiqiang Yuan, Wenkai Zhang, Kun fu, Xuan Li, Chubo Deng, Hongqi Wang, Xian Sun
Our model adapts to multi-scale feature inputs, favors multi-source retrieval methods, and can dynamically filter redundant features.
Ranked #10 on Cross-Modal Retrieval on RSITMD
1 code implementation • 21 Apr 2022 • Zhiqiang Yuan, Wenkai Zhang, Changyuan Tian, Xuee Rong, Zhengyuan Zhang, Hongqi Wang, Kun fu, Xian Sun
In this article, we first propose a novel RSCTIR framework based on global and local information (GaLR), and design a multi-level information dynamic fusion (MIDF) module to efficaciously integrate features of different levels.
Ranked #8 on Cross-Modal Retrieval on RSITMD
no code implementations • 11 Apr 2022 • Yongqiang Mao, Xian Sun, Kaiqiang Chen, Wenhui Diao, Zonghao Guo, Xiaonan Lu, Kun fu
Due to the unicity of receptive field, semantic segmentation of point clouds remains challenging for the expression of multi-receptive field features, which brings about the misclassification of instances with similar spatial structures.
no code implementations • IEEE Transactions on Geoscience and Remote Sensing 2021 • Bing Wang, Zhirui Wang, Xian Sun, Hongqi Wang, Kun fu
After metatraining, DMML-Net can be applied for the few-shot segmentation tasks of novel geographic objects with only a few gradient steps on the small training set.
no code implementations • 9 Mar 2021 • Xian Sun, Peijin Wang, Zhiyuan Yan, Feng Xu, Ruiping Wang, Wenhui Diao, Jin Chen, Jihao Li, Yingchao Feng, Tao Xu, Martin Weinmann, Stefan Hinz, Cheng Wang, Kun fu
In this paper, we propose a novel benchmark dataset with more than 1 million instances and more than 15, 000 images for Fine-grAined object recognItion in high-Resolution remote sensing imagery which is named as FAIR1M.
no code implementations • 24 Jun 2020 • Yiwen Sun, Kun fu, Zheng Wang, Chang-Shui Zhang, Jieping Ye
To address the data sparsity problem, we propose the Road Network Metric Learning framework for ETA (RNML-ETA).
no code implementations • 7 Jun 2020 • Yiwen Sun, Yulu Wang, Kun fu, Zheng Wang, Chang-Shui Zhang, Jieping Ye
Furthermore, in order to evaluate Fusion RNN's sequence feature extraction capability, we choose a representative data mining task for sequence data, estimated time of arrival (ETA) and present a novel model based on Fusion RNN.
no code implementations • 7 Jun 2020 • Yiwen Sun, Yulu Wang, Kun fu, Zheng Wang, Ziang Yan, Chang-Shui Zhang, Jieping Ye
Estimated time of arrival (ETA) is one of the most important services in intelligent transportation systems and becomes a challenging spatial-temporal (ST) data mining task in recent years.
no code implementations • 23 Apr 2020 • Yiwen Sun, Yulu Wang, Kun fu, Zheng Wang, Chang-Shui Zhang, Jieping Ye
Recently, deep learning based methods have achieved promising results by adopting graph convolutional network (GCN) to extract the spatial correlations and recurrent neural network (RNN) to capture the temporal dependencies.
no code implementations • 9 Jan 2020 • Ruigang Niu, Xian Sun, Yu Tian, Wenhui Diao, Kaiqiang Chen, Kun fu
Semantic segmentation in very high resolution (VHR) aerial images is one of the most challenging tasks in remote sensing image understanding.
no code implementations • 4 Apr 2019 • Tengfei Zhang, Yue Zhang, Xian Sun, Hao Sun, Menglong Yan, Xue Yang, Kun fu
A two-stage detector for OSCD is introduced to compare the extracted query and target features with the learnable metric to approach the optimized non-linear conditional probability.
no code implementations • 4 Apr 2019 • Tengfei Zhang, Yue Zhang, Xian Sun, Menglong Yan, Yaoling Wang, Kun fu
Deep learning based object detection has achieved great success.
2 code implementations • 3 Jan 2019 • Daoyu Lin, Guangluan Xu, Xiaoke Wang, Yang Wang, Xian Sun, Kun fu
Removing clouds is an indispensable pre-processing step in remote sensing image analysis.
3 code implementations • ICCV 2019 • Xue Yang, Jirui Yang, Junchi Yan, Yue Zhang, Tengfei Zhang, Zhi Guo, Sun Xian, Kun fu
Specifically, a sampling fusion network is devised which fuses multi-layer feature with effective anchor sampling, to improve the sensitivity to small objects.
Ranked #48 on Object Detection In Aerial Images on DOTA (using extra training data)
3 code implementations • 13 Jun 2018 • Xue Yang, Hao Sun, Xian Sun, Menglong Yan, Zhi Guo, Kun fu
The complexity of application scenarios, the redundancy of detection region, and the difficulty of dense ship detection are all the main obstacles that limit the successful operation of traditional methods in ship detection.
4 code implementations • 12 Jun 2018 • Xue Yang, Hao Sun, Kun fu, Jirui Yang, Xian Sun, Menglong Yan, Zhi Guo
Additionally, in the case of ship rotation and dense arrangement, we design a rotation anchor strategy to predict the minimum circumscribed rectangle of the object so as to reduce the redundant detection region and improve the recall.
no code implementations • 30 Jan 2018 • Hongzhi Zhang, Guandong Xu, Xiao Liang, Tinglei Huang, Kun fu
Then, instead of merging the sequence into a single vector with pooling operation, soft alignments between words from the question and the relation are learned.
no code implementations • 28 Dec 2016 • Daoyu Lin, Kun fu, Yang Wang, Guangluan Xu, Xian Sun
With the development of deep learning, supervised learning has frequently been adopted to classify remotely sensed images using convolutional networks (CNNs).
no code implementations • 1 Sep 2016 • Junqi Jin, Ziang Yan, Kun fu, Nan Jiang, Chang-Shui Zhang
Deep learning models' architectures, including depth and width, are key factors influencing models' performance, such as test accuracy and computation time.
no code implementations • 29 Aug 2016 • Junqi Jin, Ziang Yan, Kun fu, Nan Jiang, Chang-Shui Zhang
A greedy algorithm with bounds is suggested to solve the transformed problem.
1 code implementation • 20 Jun 2015 • Junqi Jin, Kun fu, Runpeng Cui, Fei Sha, Chang-Shui Zhang
In this paper, we propose an image caption system that exploits the parallel structures between images and sentences.
no code implementations • 15 Jan 2013 • Zhen Hu, Kun fu, Chang-Shui Zhang
We think our method is promising even though we test it in a different data set, since our data set is comparable to that in MIREX by size.