no code implementations • 25 Nov 2024 • Ruofan Jia, Weiying Xie, Jie Lei, Haonan Qin, Jitao Ma, Leyuan Fang
As demand grows for complex tasks and high-performance applications in edge computing, the deployment of large models in federated learning has become increasingly urgent, given their superior representational power and generalization capabilities.
no code implementations • 17 Nov 2024 • Wenjin Guo, Donglai Liu, Weiying Xie, Yunsong Li, Xuefei Ning, Zihan Meng, Shulin Zeng, Jie Lei, Zhenman Fang, Yu Wang
Our integer training framework includes two components: ShiftQuant to realize accurate gradient estimation, and L1 normalization to smoothen the loss landscape.
no code implementations • 15 Oct 2024 • Shuhan Dong, Yunsong Li, Weiying Xie, Jiaqing Zhang, Jiayuan Tian, Danian Yang, Jie Lei
Multimodal object detection leverages diverse modal information to enhance the accuracy and robustness of detectors.
no code implementations • 26 Aug 2024 • Daixun Li, Weiying Xie, Mingxiang Cao, Yunke Wang, Jiaqing Zhang, Yunsong Li, Leyuan Fang, Chang Xu
In this paper, we introduce SAM into multimodal image segmentation for the first time, proposing a novel framework that combines Latent Space Token Generation (LSTG) and Fusion Mask Prompting (FMP) modules to enhance SAM's multimodal fusion and segmentation capabilities.
no code implementations • 27 Jul 2024 • Shuran Ma, Weiying Xie, Daixun Li, Haowei Li, Yunsong Li
To comprehensively address this challenge, we introduce FedCD (Cross-Domain Invariant Federated Learning), an overall optimization framework at both the local and global levels.
1 code implementation • 23 Jul 2024 • Weiying Xie, Yusi Zhang, Tianlin Hui, Jiaqing Zhang, Jie Lei, Yunsong Li
In this paper, we propose a novel multimodal object detector, named Low-rank Modal Adaptors (LMA) with a shared backbone.
no code implementations • 20 Apr 2024 • Yidan Liu, Weiying Xie, Kai Jiang, Jiaqing Zhang, Yunsong Li, Leyuan Fang
The majority of existing hyperspectral anomaly detection (HAD) methods use the low-rank representation (LRR) model to separate the background and anomaly components, where the anomaly component is optimized by handcrafted sparse priors (e. g., $\ell_{2, 1}$-norm).
no code implementations • 13 Apr 2024 • Yidan Liu, Jun Yue, Shaobo Xia, Pedram Ghamisi, Weiying Xie, Leyuan Fang
As a newly emerging advance in deep generative models, diffusion models have achieved state-of-the-art results in many fields, including computer vision, natural language processing, and molecule design.
1 code implementation • 14 Mar 2024 • Jiaqing Zhang, Mingxiang Cao, Xue Yang, Weiying Xie, Jie Lei, Daixun Li, Wenbo Huang, Yunsong Li
Multimodal image fusion and object detection are crucial for autonomous driving.
no code implementations • 13 Jan 2024 • Kai Jiang, Jiaxing Huang, Weiying Xie, Jie Lei, Yunsong Li, Ling Shao, Shijian Lu
Large-vocabulary object detectors (LVDs) aim to detect objects of many categories, which learn super objectness features and can locate objects accurately while applied to various downstream data.
no code implementations • 13 Jan 2024 • Kai Jiang, Jiaxing Huang, Weiying Xie, Yunsong Li, Ling Shao, Shijian Lu
Camera-only Bird's Eye View (BEV) has demonstrated great potential in environment perception in a 3D space.
no code implementations • 10 Jan 2024 • Jiayuan Tian, Jie Lei, Jiaqing Zhang, Weiying Xie, Yunsong Li
Effectively leveraging this data through self-supervised learning (SSL) is vital in the field of remote sensing.
1 code implementation • 6 Jan 2024 • Jiaqing Zhang, Jie Lei, Weiying Xie, Kai Jiang, Mingxiang Cao, Yunsong Li
Accurate cloud recognition and warning are crucial for various applications, including in-flight support, weather forecasting, and climate research.
1 code implementation • 6 Jan 2024 • Jiaqing Zhang, Jie Lei, Weiying Xie, Geng Yang, Daixun Li, Yunsong Li
Additionally, the information distribution flow (IDF) in MIVit enhances performance-awareness by distributing global classification information across different modalities' feature maps.
no code implementations • 2 Jan 2024 • Jitao Ma, Weiying Xie, Yunsong Li
Our method provides a new solution space for HAD that is closer to human visual perception with high confidence.
1 code implementation • CVPR 2024 • Weiying Xie, Haowei Li, Jitao Ma, Yunsong Li, Jie Lei, Donglai Liu, Leyuan Fang
In this paper we propose Joint Sparsification-Quantization (JointSQ) inspired by the discovery that sparsification can be treated as 0-bit quantization regardless of architectures.
no code implementations • 29 Dec 2023 • Weiying Xie, Zixuan Wang, Jitao Ma, Daixun Li, Yunsong Li
The key component of RS-DGC is a Neighborhood Statistical Indicator (NSI), which can quantify the importance of gradients within a specified neighborhood on each node to sparsify the local gradients before gradient transmission in each iteration.
no code implementations • 28 Dec 2023 • Mingxiang Cao, Weiying Xie, Jie Lei, Jiaqing Zhang, Daixun Li, Yunsong Li
Deep learning has driven significant progress in object detection using Synthetic Aperture Radar (SAR) imagery.
1 code implementation • 1 Dec 2023 • Weiying Xie, Xiaoyi Fan, Xin Zhang, Yunsong Li, Jie Lei, Leyuan Fang
Pruning-quantization joint learning always facilitates the deployment of deep neural networks (DNNs) on resource-constrained edge devices.
1 code implementation • CVPR 2024 • Xin Zhang, Jiawei Du, Yunsong Li, Weiying Xie, Joey Tianyi Zhou
Dataset pruning aims to construct a coreset capable of achieving performance comparable to the original, full dataset.
no code implementations • 16 Nov 2023 • Daixun Li, Weiying Xie, Zixuan Wang, YiBing Lu, Yunsong Li, Leyuan Fang
With the rapid development of imaging sensor technology in the field of remote sensing, multi-modal remote sensing data fusion has emerged as a crucial research direction for land cover classification tasks.
no code implementations • 16 Nov 2023 • Daixun Li, Weiying Xie, Jiaqing Zhang, Yunsong Li
High-dimensional images, known for their rich semantic information, are widely applied in remote sensing and other fields.
1 code implementation • 16 Nov 2023 • Daixun Li, Weiying Xie, Yunsong Li, Leyuan Fang
Multi-satellite, multi-modality in-orbit fusion is a challenging task as it explores the fusion representation of complex high-dimensional data under limited computational resources.
1 code implementation • 19 Jul 2023 • Jitao Ma, Weiying Xie, Yunsong Li, Leyuan Fang
We present a novel solution BSDM (background suppression diffusion model) for HAD, which can simultaneously learn latent background distributions and generalize to different datasets for suppressing complex background.
1 code implementation • CVPR 2023 • Weiying Xie, Kai Jiang, Yunsong Li, Jie Lei, Leyuan Fang, Wen-jin Guo
Specifically, we create a positive cycle between fusion and degradation estimation under a new probabilistic framework.
1 code implementation • 31 Dec 2022 • Jiaqing Zhang, Jie Lei, Weiying Xie, Yunsong Li, Xiuping Jia
More concretely, we first design a structure called guided quantization self-distillation (GQSD), which is an innovative idea for realizing lightweight through the synergy of quantization and distillation.
Ranked #1 on Object Detection on VEDAI
1 code implementation • 27 Sep 2022 • Jiaqing Zhang, Jie Lei, Weiying Xie, Zhenman Fang, Yunsong Li, Qian Du
Furthermore, we design a simple and flexible SR branch to learn HR feature representations that can discriminate small objects from vast backgrounds with low-resolution (LR) input, thus further improving the detection accuracy.
Ranked #3 on Object Detection on VEDAI
no code implementations • 18 Apr 2022 • Jun Yue, Leyuan Fang, Pedram Ghamisi, Weiying Xie, Jun Li, Jocelyn Chanussot, Antonio J Plaza
Therefore, remote sensing image understanding often faces the problems of incomplete, inexact, and inaccurate supervised information, which will affect the breadth and depth of remote sensing applications.
1 code implementation • 15 Dec 2021 • Li Xu, Gang He, Jinjia Zhou, Jie Lei, Weiying Xie, Yunsong Li, Yu-Wing Tai
In most video platforms, such as Youtube, and TikTok, the played videos usually have undergone multiple video encodings such as hardware encoding by recording devices, software encoding by video editing apps, and single/multiple video transcoding by video application servers.
no code implementations • 1 Jan 2021 • Tao Jiang, Weiying Xie, Jie Lei, Yunsong Li, Zan Li
For solving these problems, this paper proposes a sparse coding-inspired generative adversarial network (GAN) for weakly supervised HAD, named sparseHAD.
no code implementations • 1 Jan 2021 • Jiaping Zhong, Weiying Xie, Jie Lei, Yunsong Li, Zan Li
Hyperspectral anomaly detection under high-dimensional data and interference of deteriorated bands without any prior information has been challenging and attracted close attention in the exploration of the unknown in real scenarios.