no code implementations • 3 Jun 2018 • Pu Jin, Gui-Song Xia, Fan Hu, Qikai Lu, Liangpei Zhang
Aerial image scene classification is a fundamental problem for understanding high-resolution remote sensing images and has become an active research task in the field of remote sensing due to its important role in a wide range of applications.
no code implementations • 30 Jan 2020 • Lichao Mou, Yuansheng Hua, Pu Jin, Xiao Xiang Zhu
In this paper, we introduce a novel problem of event recognition in unconstrained aerial videos in the remote sensing community and present a large-scale, human-annotated dataset, named ERA (Event Recognition in Aerial videos), consisting of 2, 864 videos each with a label from 25 different classes corresponding to an event unfolding 5 seconds.
1 code implementation • ECCV 2020 • Di Hu, Xuhong LI, Lichao Mou, Pu Jin, Dong Chen, Liping Jing, Xiaoxiang Zhu, Dejing Dou
With the help of this dataset, we evaluate three proposed approaches for transferring the sound event knowledge to the aerial scene recognition task in a multimodal learning framework, and show the benefit of exploiting the audio information for the aerial scene recognition.
no code implementations • 6 Jun 2020 • Qingyu Li, Lichao Mou, Yuansheng Hua, Yao Sun, Pu Jin, Yilei Shi, Xiao Xiang Zhu
The detected keypoints are subsequently reformulated as a closed polygon, which is the semantic boundary of the building.
1 code implementation • 7 Apr 2021 • Yuansheng Hua, Lichao Mou, Pu Jin, Xiao Xiang Zhu
We conduct experiments with extensive baseline models on both MultiScene-Clean and MultiScene to offer benchmarks for multi-scene recognition in single images and learning from noisy labels for this task, respectively.
1 code implementation • 2 Aug 2021 • Konrad Heidler, Lichao Mou, Di Hu, Pu Jin, Guangyao Li, Chuang Gan, Ji-Rong Wen, Xiao Xiang Zhu
By fine-tuning the models on a number of commonly used remote sensing datasets, we show that our approach outperforms existing pre-training strategies for remote sensing imagery.
Ranked #2 on Cross-Modal Retrieval on SoundingEarth
no code implementations • 22 Sep 2022 • Pu Jin, Lichao Mou, Yuansheng Hua, Gui-Song Xia, Xiao Xiang Zhu
Furthermore, the holistic features are refined by the multi-scale temporal relations in a novel fusion module for yielding more discriminative video representations.
1 code implementation • 25 Sep 2022 • Pu Jin, Lichao Mou, Gui-Song Xia, Xiao Xiang Zhu
In this paper, we create a new dataset, named DroneAnomaly, for anomaly detection in aerial videos.