no code implementations • 20 May 2025 • Xingxing Weng, Chao Pang, Gui-Song Xia
Vision-language modeling (VLM) aims to bridge the information gap between images and natural language.
1 code implementation • 21 Aug 2024 • Chuandong Liu, Xingxing Weng, Shuguo Jiang, Pengcheng Li, Lei Yu, Gui-Song Xia
Unlike most methods that include all points in pseudo-labeled scenes for forward propagation but only pseudo-labeled points for backpropagation, AIScene removes points without pseudo-labels, ensuring consistency in both forward and backward propagation within the scene.
2 code implementations • 29 Mar 2024 • Chao Pang, Xingxing Weng, Jiang Wu, Jiayu Li, Yi Liu, Jiaxing Sun, Weijia Li, Shuai Wang, Litong Feng, Gui-Song Xia, Conghui He
VHM is built on a large-scale remote sensing image-text dataset with rich-content captions (VersaD), and an honest instruction dataset comprising both factual and deceptive questions (HnstD).
1 code implementation • 19 Jan 2024 • Chao Pang, Xingxing Weng, Jiang Wu, Qiang Wang, Gui-Song Xia
This ensures effective knowledge transfer while maintaining the student model's training flexibility.
1 code implementation • 1 Jun 2023 • Tamer Saleh, Xingxing Weng, Shimaa Holail, Chen Hao, Gui-Song Xia
The detection of flooded areas using high-resolution synthetic aperture radar (SAR) imagery is a critical task with applications in crisis and disaster management, as well as environmental resource planning.