Search Results for author: Kun fu

Found 34 papers, 8 papers with code

Prompt-and-Transfer: Dynamic Class-aware Enhancement for Few-shot Segmentation

no code implementations16 Sep 2024 Hanbo Bi, Yingchao Feng, Wenhui Diao, Peijin Wang, Yongqiang Mao, Kun fu, Hongqi Wang, Xian Sun

For more efficient generalization to unseen domains (classes), most Few-shot Segmentation (FSS) would directly exploit pre-trained encoders and only fine-tune the decoder, especially in the current era of large models.

Decoder Task 2 +1

AeroVerse: UAV-Agent Benchmark Suite for Simulating, Pre-training, Finetuning, and Evaluating Aerospace Embodied World Models

no code implementations28 Aug 2024 Fanglong Yao, Yuanchang Yue, Youzhi Liu, Xian Sun, Kun fu

Aerospace embodied intelligence aims to empower unmanned aerial vehicles (UAVs) and other aerospace platforms to achieve autonomous perception, cognition, and action, as well as egocentric active interaction with humans and the environment.

Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes

no code implementations30 May 2024 Yong-Qiang Mao, Hanbo Bi, Xuexue Li, Kaiqiang Chen, Zhirui Wang, Xian Sun, Kun fu

Thanks to the application of deep learning technology in point cloud processing of the remote sensing field, point cloud segmentation has become a research hotspot in recent years, which can be applied to real-world 3D, smart cities, and other fields.

Point Cloud Segmentation Segmentation +1

SDL-MVS: View Space and Depth Deformable Learning Paradigm for Multi-View Stereo Reconstruction in Remote Sensing

no code implementations27 May 2024 Yong-Qiang Mao, Hanbo Bi, Liangyu Xu, Kaiqiang Chen, Zhirui Wang, Xian Sun, Kun fu

To solve the above problem, we re-examine the deformable learning method in the Multi-View Stereo task and propose a novel paradigm based on view Space and Depth deformable Learning (SDL-MVS), aiming to learn deformable interactions of features in different view spaces and deformably model the depth ranges and intervals to enable high accurate depth estimation.

3D Reconstruction Depth Estimation

TAFormer: A Unified Target-Aware Transformer for Video and Motion Joint Prediction in Aerial Scenes

no code implementations27 Mar 2024 Liangyu Xu, Wanxuan Lu, Hongfeng Yu, Yongqiang Mao, Hanbo Bi, Chenglong Liu, Xian Sun, Kun fu

To address this issue, we introduce a novel task called Target-Aware Aerial Video Prediction, aiming to simultaneously predict future scenes and motion states of the target.

Disaster Response Object Tracking +1

SFTformer: A Spatial-Frequency-Temporal Correlation-Decoupling Transformer for Radar Echo Extrapolation

no code implementations28 Feb 2024 Liangyu Xu, Wanxuan Lu, Hongfeng Yu, Fanglong Yao, Xian Sun, Kun fu

The model leverages stacked multiple SFT-Blocks to not only mine the correlation of the spatiotemporal dynamics of echo cells but also avoid the mutual interference between the temporal modeling and the spatial morphology refinement by decoupling them.

Recognizing Multiple Ingredients in Food Images Using a Single-Ingredient Classification Model

no code implementations26 Jan 2024 Kun fu, Ying Dai

The method localizes the candidate regions of the ingredients using the locating and sliding window techniques.

Decision Making

Improving Graph Contrastive Learning via Adaptive Positive Sampling

no code implementations CVPR 2024 Jiaming Zhuo, Feiyang Qin, Can Cui, Kun fu, bingxin niu, Mengzhu Wang, Yuanfang Guo, Chuan Wang, Zhen Wang, Xiaochun Cao, Liang Yang

Graph Contrastive Learning (GCL) a Self-Supervised Learning (SSL) architecture tailored for graphs has shown notable potential for mitigating label scarcity.

Contrastive Learning Self-Supervised Learning

SPHR-SAR-Net: Superpixel High-resolution SAR Imaging Network Based on Nonlocal Total Variation

no code implementations10 Apr 2023 Guoru Zhou, Zhongqiu Xu, Yizhe Fan, Zhe Zhang, Xiaolan Qiu, Bingchen Zhang, Kun fu, Yirong Wu

High-resolution is a key trend in the development of synthetic aperture radar (SAR), which enables the capture of fine details and accurate representation of backscattering properties.

Elevation Estimation-Driven Building 3D Reconstruction from Single-View Remote Sensing Imagery

no code implementations11 Jan 2023 Yongqiang Mao, Kaiqiang Chen, Liangjin Zhao, Wei Chen, Deke Tang, Wenjie Liu, Zhirui Wang, Wenhui Diao, Xian Sun, Kun fu

Our Building3D is rooted in the SFFDE network for building elevation prediction, synchronized with a building extraction network for building masks, and then sequentially performs point cloud reconstruction, surface reconstruction (or CityGML model reconstruction).

Point cloud reconstruction Surface Reconstruction

Beyond single receptive field: A receptive field fusion-and-stratification network for airborne laser scanning point cloud classification

1 code implementation21 Jul 2022 Yongqiang Mao, Kaiqiang Chen, Wenhui Diao, Xian Sun, Xiaonan Lu, Kun fu, Martin Weinmann

With receptive field fusion-and-stratification, RFFS-Net is more adaptable to the classification of regions with complex structures and extreme scale variations in large-scale ALS point clouds.

Classification Point Cloud Classification

Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval

1 code implementation21 Apr 2022 Zhiqiang Yuan, Wenkai Zhang, Kun fu, Xuan Li, Chubo Deng, Hongqi Wang, Xian Sun

Our model adapts to multi-scale feature inputs, favors multi-source retrieval methods, and can dynamically filter redundant features.

Cross-Modal Retrieval Image Retrieval +3

Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information

1 code implementation21 Apr 2022 Zhiqiang Yuan, Wenkai Zhang, Changyuan Tian, Xuee Rong, Zhengyuan Zhang, Hongqi Wang, Kun fu, Xian Sun

In this article, we first propose a novel RSCTIR framework based on global and local information (GaLR), and design a multi-level information dynamic fusion (MIDF) module to efficaciously integrate features of different levels.

Cross-Modal Retrieval Image Retrieval +1

Semantic Segmentation for Point Cloud Scenes via Dilated Graph Feature Aggregation and Pyramid Decoders

no code implementations11 Apr 2022 Yongqiang Mao, Xian Sun, Kaiqiang Chen, Wenhui Diao, Zonghao Guo, Xiaonan Lu, Kun fu

Due to the unicity of receptive field, semantic segmentation of point clouds remains challenging for the expression of multi-receptive field features, which brings about the misclassification of instances with similar spatial structures.

Diversity Segmentation +1

FAIR1M: A Benchmark Dataset for Fine-grained Object Recognition in High-Resolution Remote Sensing Imagery

no code implementations9 Mar 2021 Xian Sun, Peijin Wang, Zhiyuan Yan, Feng Xu, Ruiping Wang, Wenhui Diao, Jin Chen, Jihao Li, Yingchao Feng, Tao Xu, Martin Weinmann, Stefan Hinz, Cheng Wang, Kun fu

In this paper, we propose a novel benchmark dataset with more than 1 million instances and more than 15, 000 images for Fine-grAined object recognItion in high-Resolution remote sensing imagery which is named as FAIR1M.

Object object-detection +2

Road Network Metric Learning for Estimated Time of Arrival

no code implementations24 Jun 2020 Yiwen Sun, Kun fu, Zheng Wang, Chang-Shui Zhang, Jieping Ye

To address the data sparsity problem, we propose the Road Network Metric Learning framework for ETA (RNML-ETA).

Metric Learning

Fusion Recurrent Neural Network

no code implementations7 Jun 2020 Yiwen Sun, Yulu Wang, Kun fu, Zheng Wang, Chang-Shui Zhang, Jieping Ye

Furthermore, in order to evaluate Fusion RNN's sequence feature extraction capability, we choose a representative data mining task for sequence data, estimated time of arrival (ETA) and present a novel model based on Fusion RNN.

FMA-ETA: Estimating Travel Time Entirely Based on FFN With Attention

no code implementations7 Jun 2020 Yiwen Sun, Yulu Wang, Kun fu, Zheng Wang, Ziang Yan, Chang-Shui Zhang, Jieping Ye

Estimated time of arrival (ETA) is one of the most important services in intelligent transportation systems and becomes a challenging spatial-temporal (ST) data mining task in recent years.

Constructing Geographic and Long-term Temporal Graph for Traffic Forecasting

no code implementations23 Apr 2020 Yiwen Sun, Yulu Wang, Kun fu, Zheng Wang, Chang-Shui Zhang, Jieping Ye

Recently, deep learning based methods have achieved promising results by adopting graph convolutional network (GCN) to extract the spatial correlations and recurrent neural network (RNN) to capture the temporal dependencies.

Hybrid Multiple Attention Network for Semantic Segmentation in Aerial Images

no code implementations9 Jan 2020 Ruigang Niu, Xian Sun, Yu Tian, Wenhui Diao, Kaiqiang Chen, Kun fu

Semantic segmentation in very high resolution (VHR) aerial images is one of the most challenging tasks in remote sensing image understanding.

Semantic Segmentation

Comparison Network for One-Shot Conditional Object Detection

no code implementations4 Apr 2019 Tengfei Zhang, Yue Zhang, Xian Sun, Hao Sun, Menglong Yan, Xue Yang, Kun fu

A two-stage detector for OSCD is introduced to compare the extracted query and target features with the learnable metric to approach the optimized non-linear conditional probability.

Object object-detection +1

A Remote Sensing Image Dataset for Cloud Removal

2 code implementations3 Jan 2019 Daoyu Lin, Guangluan Xu, Xiaoke Wang, Yang Wang, Xian Sun, Kun fu

Removing clouds is an indispensable pre-processing step in remote sensing image analysis.

Change Detection Cloud Removal +1

SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects

3 code implementations ICCV 2019 Xue Yang, Jirui Yang, Junchi Yan, Yue Zhang, Tengfei Zhang, Zhi Guo, Sun Xian, Kun fu

Specifically, a sampling fusion network is devised which fuses multi-layer feature with effective anchor sampling, to improve the sensitivity to small objects.

Ranked #48 on Object Detection In Aerial Images on DOTA (using extra training data)

object-detection Object Detection In Aerial Images

Position Detection and Direction Prediction for Arbitrary-Oriented Ships via Multitask Rotation Region Convolutional Neural Network

3 code implementations13 Jun 2018 Xue Yang, Hao Sun, Xian Sun, Menglong Yan, Zhi Guo, Kun fu

The complexity of application scenarios, the redundancy of detection region, and the difficulty of dense ship detection are all the main obstacles that limit the successful operation of traditional methods in ship detection.

Position

Automatic Ship Detection of Remote Sensing Images from Google Earth in Complex Scenes Based on Multi-Scale Rotation Dense Feature Pyramid Networks

4 code implementations12 Jun 2018 Xue Yang, Hao Sun, Kun fu, Jirui Yang, Xian Sun, Menglong Yan, Zhi Guo

Additionally, in the case of ship rotation and dense arrangement, we design a rotation anchor strategy to predict the minimum circumscribed rectangle of the object so as to reduce the redundant detection region and improve the recall.

object-detection Object Detection

An Attention-Based Word-Level Interaction Model: Relation Detection for Knowledge Base Question Answering

no code implementations30 Jan 2018 Hongzhi Zhang, Guandong Xu, Xiao Liang, Tinglei Huang, Kun fu

Then, instead of merging the sequence into a single vector with pooling operation, soft alignments between words from the question and the relation are learned.

Knowledge Base Question Answering Relation +2

MARTA GANs: Unsupervised Representation Learning for Remote Sensing Image Classification

no code implementations28 Dec 2016 Daoyu Lin, Kun fu, Yang Wang, Guangluan Xu, Xian Sun

With the development of deep learning, supervised learning has frequently been adopted to classify remotely sensed images using convolutional networks (CNNs).

Classification General Classification +3

Neural Network Architecture Optimization through Submodularity and Supermodularity

no code implementations1 Sep 2016 Junqi Jin, Ziang Yan, Kun fu, Nan Jiang, Chang-Shui Zhang

Deep learning models' architectures, including depth and width, are key factors influencing models' performance, such as test accuracy and computation time.

Optimizing Recurrent Neural Networks Architectures under Time Constraints

no code implementations29 Aug 2016 Junqi Jin, Ziang Yan, Kun fu, Nan Jiang, Chang-Shui Zhang

A greedy algorithm with bounds is suggested to solve the transformed problem.

Aligning where to see and what to tell: image caption with region-based attention and scene factorization

1 code implementation20 Jun 2015 Junqi Jin, Kun fu, Runpeng Cui, Fei Sha, Chang-Shui Zhang

In this paper, we propose an image caption system that exploits the parallel structures between images and sentences.

Image Captioning

Audio Classical Composer Identification by Deep Neural Network

no code implementations15 Jan 2013 Zhen Hu, Kun fu, Chang-Shui Zhang

We think our method is promising even though we test it in a different data set, since our data set is comparable to that in MIREX by size.

Denoising Information Retrieval +2

Cannot find the paper you are looking for? You can Submit a new open access paper.