1 code implementation • 9 Dec 2024 • Lianyu Hu, Fanhua Shang, Liang Wan, Wei Feng
iLLaVA can nearly 2$\times$ the throughput, and reduce the memory costs by half with only a 0. 2\% - 0. 5\% performance drop across models of different scales including 7B, 13B and 34B.
1 code implementation • 9 Oct 2024 • Lianyu Hu, Tongkai Shi, Wei Feng, Fanhua Shang, Liang Wan
Plentiful ablations are further given to show the generalizability and reliability of our method upon different modality-missing ratios and types.
no code implementations • 6 Jul 2024 • Tianling Liu, Hongying Liu, Fanhua Shang, Lequan Yu, Tong Han, Liang Wan
Specifically, the CFD strategy not only identifies modality-shared and modality-specific features, but also decouples shared features among subsets of multimodal inputs, termed as modality-partial-shared features.
no code implementations • 7 May 2024 • Junting Zhao, Yang Zhou, Zhihao Chen, Huazhu Fu, Liang Wan
To ensure comprehensive learning of both common and rare topics, we categorize queries into common and rare types to learn differentiated topics, and then propose Topic Contrastive Loss to effectively align topics and queries in the latent space.
1 code implementation • 17 Apr 2024 • Lianyu Hu, Wei Feng, Liqing Gao, Zekang Liu, Liang Wan
In specific, CorrNet+ employs a correlation module and an identification module to build human body trajectories.
no code implementations • 2 Jan 2024 • Fan Lyu, Wei Feng, Yuepan Li, Qing Sun, Fanhua Shang, Liang Wan, Liang Wang
The goal of Continual Learning (CL) is to continuously learn from new data streams and accomplish the corresponding tasks.
1 code implementation • 31 Oct 2023 • Liang Liao, Liang Wan, Mingsheng Liu, Shusheng Li
To be precise, we use the Dual-Guided Attention (DGA) module we proposed to replace some multi-scale transformations with the calculation of attention which means we only use several attention layers of near linear complexity to achieve performance comparable to frequently-used multi-layer fusion.
no code implementations • 31 Oct 2023 • Weiqi Li, Fan Lyu, Fanhua Shang, Liang Wan, Wei Feng
Real-world data is extremely imbalanced and presents a long-tailed distribution, resulting in models that are biased towards classes with sufficient samples and perform poorly on rare classes.
2 code implementations • 21 Aug 2023 • Zhijin Ge, Fanhua Shang, Hongying Liu, Yuanyuan Liu, Liang Wan, Wei Feng, Xiaosen Wang
Deep neural networks are vulnerable to adversarial examples crafted by applying human-imperceptible perturbations on clean inputs.
no code implementations • 15 Jun 2023 • Liang Wan, Hongqing Liu, Yi Zhou, Jie Ji
By combining the DPRNN module with Convolution Recurrent Network (CRN), the DPCRN obtained a promising performance in speech separation with a limited model size.
1 code implementation • 18 Mar 2023 • Zhaohu Xing, Lei Zhu, Lequan Yu, Zhiheng Xing, Liang Wan
Masked image modeling (MIM) with transformer backbones has recently been exploited as a powerful self-supervised pre-training technique.
1 code implementation • 18 Mar 2023 • Zhaohu Xing, Liang Wan, Huazhu Fu, Guang Yang, Lei Zhu
Our experimental results also indicate the universality and effectiveness of the proposed model.
no code implementations • 16 Mar 2023 • Zhihao Chen, Liang Wan, Yefan Xiao, Lei Zhu, Huazhu Fu
Then, we develop a progressive aggregation module to enhance the spatio and temporal characteristics of features maps, and effectively integrate the three kinds of features.
no code implementations • 14 Mar 2023 • Zhihao Chen, Yang Zhou, Anh Tran, Junting Zhao, Liang Wan, Gideon Ooi, Lionel Cheng, Choon Hua Thng, Xinxing Xu, Yong liu, Huazhu Fu
To enable MedRPG to locate nuanced medical findings with better region-phrase correspondences, we further propose Tri-attention Context contrastive alignment (TaCo).
no code implementations • ICCV 2023 • Fan Lyu, Qing Sun, Fanhua Shang, Liang Wan, Wei Feng
In Parallel Continual Learning (PCL), the parallel multiple tasks start and end training unpredictably, thus suffering from training conflict and catastrophic forgetting issues.
1 code implementation • 25 Sep 2022 • Qing Sun, Fan Lyu, Fanhua Shang, Wei Feng, Liang Wan
Continual Learning (CL) sequentially learns new tasks like human beings, with the goal to achieve better Stability (S, remembering past tasks) and Plasticity (P, adapting to new tasks).
1 code implementation • 4 Sep 2022 • Tianling Liu, Wennan Liu, Lequan Yu, Liang Wan, Tong Han, Lei Zhu
Preoperative and noninvasive prediction of the meningioma grade is important in clinical practice, as it directly influences the clinical decision making.
1 code implementation • 31 Aug 2022 • Zhaohu Xing, Lequan Yu, Liang Wan, Tong Han, Lei Zhu
Multi-modal MR imaging is routinely used in clinical practice to diagnose and investigate brain tumors by providing rich complementary information.
no code implementations • 29 Sep 2021 • Qing Sun, Fan Lyu, Fanhua Shang, Wei Feng, Liang Wan
Traditionally, the primary goal of LL is to achieve the trade-off between the Stability (remembering past tasks) and Plasticity (adapting to new tasks).
1 code implementation • 6 Aug 2021 • Ye Liu, Lei Zhu, Shunda Pei, Huazhu Fu, Jing Qin, Qing Zhang, Liang Wan, Wei Feng
Our DID-Net predicts the three component maps by progressively integrating features across scales, and refines each map by passing an independent refinement network.
Ranked #8 on Image Dehazing on Haze4k
1 code implementation • CVPR 2021 • Zhihao Chen, Liang Wan, Lei Zhu, Jia Shen, Huazhu Fu, Wennan Liu, Jing Qin
The bottleneck is the lack of a well-established dataset with high-quality annotations for video shadow detection.
no code implementations • 22 Feb 2020 • Qian Zhang, Wei Feng, Liang Wan, Fei-Peng Tian, Xiaowei Wang, Ping Tan
Besides, we also theoretically prove the invariance of our ALR approach to the ambiguity of normal and lighting decomposition.
no code implementations • 21 Aug 2019 • Qing Guo, Wei Feng, Zhihao Chen, Ruijun Gao, Liang Wan, Song Wang
In this paper, we address these two problems by constructing a Blurred Video Tracking benchmark, which contains a variety of videos with different levels of motion blurs, as well as ground truth tracking results for evaluating trackers.
no code implementations • 26 Jul 2019 • Ruize Han, Yujun Zhang, Wei Feng, Chenxing Gong, Xiao-Yu Zhang, Jiewen Zhao, Liang Wan, Song Wang
However, for such collaborative analysis, the first step is to associate people, referred to as subjects in this paper, across these two views.
1 code implementation • 5 Dec 2018 • Mengya Gao, Yujun Shen, Quanquan Li, Junjie Yan, Liang Wan, Dahua Lin, Chen Change Loy, Xiaoou Tang
Knowledge Distillation (KD) aims at improving the performance of a low-capacity student model by inheriting knowledge from a high-capacity teacher model.
no code implementations • ICCV 2017 • Qing Guo, Wei Feng, Ce Zhou, Rui Huang, Liang Wan, Song Wang
How to effectively learn temporal variation of target appearance, to exclude the interference of cluttered background, while maintaining real-time response, is an essential problem of visual object tracking.
Ranked #5 on Visual Object Tracking on OTB-2013
no code implementations • ICCV 2015 • Wei Feng, Fei-Peng Tian, Qian Zhang, Nan Zhang, Liang Wan, Jizhou Sun
To guarantee detection sensitivity and accuracy of minute changes, in an observation, we capture a group of images under multiple illuminations, which need only to be roughly aligned to the last time lighting conditions.
no code implementations • CVPR 2013 • Liang Li, Wei Feng, Liang Wan, Jiawan Zhang
For this purpose, we aim at constructing maximum cohesive SP-grid, which is composed of real nodes, i. e. SPs, and dummy nodes that are meaningless in the image with only position-taking function in the grid.