no code implementations • 10 May 2021 • Bin Zhao, Haopeng Li, Xiaoqiang Lu, Xuelong Li
Then, the videos are summarized by exploiting both the local and global dependencies among shots.
no code implementations • 9 Mar 2021 • Yuan Yuan, Hailong Ning, Xiaoqiang Lu
In this paper, a novel VAP method is proposed to generate visual attention map via bio-inspired representation learning.
no code implementations • IEEE Transactions on Image Processing 2021 • Hao Sun, Xiangtao Zheng, Xiaoqiang Lu
To explore the spatial information for HSI classification, pixels with its adjacent pixels are usually directly cropped from hyperspectral data to form HSI cubes in CNN-based methods.
no code implementations • 29 May 2019 • Xuelong. Li, Aihong Yuan, Xiaoqiang Lu
To make full use of these information, this paper attempt to exploit the text guided attention and semantic-guided attention (SA) to find the more correlated spatial information and reduce the semantic gap between vision and language.
no code implementations • 28 Apr 2019 • Bin Zhao, Xuelong. Li, Xiaoqiang Lu
Compared to traditional RNNs, H-RNN is more suitable to video summarization, since it can exploit long temporal dependency among frames, meanwhile, the computation operations are significantly lessened.
no code implementations • 24 Apr 2019 • Bin Zhao, Xuelong. Li, Xiaoqiang Lu, Zhigang Wang
To address this problem, we make the first attempt to view weather recognition as a multi-label classification task, i. e., assigning an image more than one labels according to the displayed weather conditions.
no code implementations • 24 Apr 2019 • Xuelong. Li, Bin Zhao, Xiaoqiang Lu
Besides, the property-weights are learned for edited videos and raw videos, respectively.
no code implementations • 21 Apr 2019 • Aihong Yuan, Xuelong. Li, Xiaoqiang Lu
In this paper, we propose a model with 3-gated model which fuses the global and local image features together for the task of image caption generation.
no code implementations • 20 Apr 2019 • Xuelong. Li, Aihong Yuan, Xiaoqiang Lu
And in the testing step, when an image is imported to our multi-modal GRU model, a sentence which describes the image content is generated.
no code implementations • CVPR 2018 • Bin Zhao, Xuelong. Li, Xiaoqiang Lu
Although video summarization has achieved great success in recent years, few approaches have realized the influence of video structure on the summarization results.
2 code implementations • 21 Dec 2017 • Xiaoqiang Lu, Binqiang Wang, Xiangtao Zheng, Xuelong. Li
Finally, a comprehensive review is presented on the proposed data set to fully advance the task of remote sensing caption.
no code implementations • ICCV 2017 • Xuelong. Li, Di Hu, Xiaoqiang Lu
Image is usually taken for expressing some kinds of emotions or purposes, such as love, celebrating Christmas.
4 code implementations • 1 Mar 2017 • Gong Cheng, Junwei Han, Xiaoqiang Lu
During the past years, significant efforts have been made to develop various datasets or present a variety of approaches for scene classification from remote sensing images.
no code implementations • CVPR 2016 • Di Hu, Xuelong. Li, Xiaoqiang Lu
Recently, audiovisual speech recognition based the MRBM has attracted much attention, and the MRBM shows its effectiveness in learning the joint representation across audiovisual modalities.