no code implementations • 16 Jul 2024 • Pengxiang Li, Zhi Gao, Bofei Zhang, Tao Yuan, Yuwei Wu, Mehrtash Harandi, Yunde Jia, Song-Chun Zhu, Qing Li
Vision language models (VLMs) have achieved impressive progress in diverse applications, becoming a prevalent research direction.
no code implementations • 28 Mar 2024 • Junjie Wen, Jinqiang Cui, Benyun Zhao, Bingxin Han, Xuchen Liu, Zhi Gao, Ben M. Chen
Furthermore, to ensure balanced training for both tasks, we present a multi-stage training strategy aimed at consistently enhancing their performance.
no code implementations • 18 Mar 2024 • Yue Fan, Xiaojian Ma, Rujie Wu, Yuntao Du, Jiaqi Li, Zhi Gao, Qing Li
We explore how reconciling several foundation models (large language models and vision-language models) with a novel unified memory mechanism could tackle the challenging video understanding problem, especially capturing the long-term temporal relations in lengthy videos.
1 code implementation • 18 Dec 2023 • Hanqing Guo, Ye Zheng, Yin Zhang, Zhi Gao, Shiyu Zhao
In this paper, we propose a global-local MAV detector that can fuse both motion and appearance features for MAV detection under challenging conditions.
no code implementations • CVPR 2024 • Zhi Gao, Yuntao Du, Xintong Zhang, Xiaojian Ma, Wenjuan Han, Song-Chun Zhu, Qing Li
However, these methods often overlook the potential for continual learning, typically by freezing the utilized tools, thus limiting their adaptation to environments requiring new knowledge.
no code implementations • CVPR 2023 • Zhi Gao, Chen Xu, Feng Li, Yunde Jia, Mehrtash Harandi, Yuwei Wu
Our method dynamically expands the geometry of the underlying space to match growing geometric structures induced by new data, and prevents forgetting by keeping geometric structures of old data into account.
no code implementations • CVPR 2023 • Jin Chen, Zhi Gao, Xinxiao wu, Jiebo Luo
Under this paradigm, we propose a meta-causal learning method to learn meta-knowledge, that is, how to infer the causes of domain shift between the auxiliary and source domains during training.
Ranked #1 on Single-Source Domain Generalization on PACS
no code implementations • 28 Mar 2023 • Yameng Wang, Yi Wan, Yongjun Zhang, Bin Zhang, Zhi Gao
The present multi-modal methods usually map high-dimensional features to low-dimensional spaces as a preprocess before feature extraction to address the nonnegligible domain gap, which inevitably leads to information loss.
1 code implementation • 16 Feb 2023 • Junjie Wen, Jinqiang Cui, Zhenjun Zhao, Ruixin Yan, Zhi Gao, Lihua Dou, Ben M. Chen
Although learning-based UIE methods have made remarkable achievements in recent years, it's still challenging for them to consistently deal with various underwater conditions, which could be caused by: 1) the use of the simplified atmospheric image formation model in UIE may result in severe errors; 2) the network trained solely with synthetic images might have difficulty in generalizing well to real underwater images.
1 code implementation • 15 Jan 2022 • Weiwei Song, Zhi Gao, Renwei Dian, Pedram Ghamisi, Yongjun Zhang, Jón Atli Benediktsson
In this paper, we propose a novel deep hashing method, named asymmetric hash code learning (AHCL), for RSIR.
1 code implementation • 21 Oct 2021 • Chenguang Wang, Ensieh Sharifnia, Zhi Gao, Simon H. Tindemans, Peter Palensky
In this paper, a multivariate load state generating model on the basis of a conditional variational autoencoder (CVAE) neural network is proposed.
no code implementations • 13 May 2021 • Weng Fei Low, Ankit Sonthalia, Zhi Gao, André van Schaik, Bharath Ramesh
Most successful computer vision models transform low-level features, such as Gabor filter responses, into richer representations of intermediate or mid-level complexity for downstream visual tasks.
no code implementations • CVPR 2021 • Jindou Dai, Yuwei Wu, Zhi Gao, Yunde Jia
Specifically, we developed a manifold-preserving graph convolution that consists of a hyperbolic feature transformation and a hyperbolic neighborhood aggregation.
no code implementations • ICCV 2021 • Zhi Gao, Yuwei Wu, Yunde Jia, Mehrtash Harandi
Few-shot learning describes the challenging problem of recognizing samples from unseen classes given very few labeled examples.
1 code implementation • 17 Dec 2020 • Kangcheng Liu, Zhi Gao, Feng Lin, Ben M. Chen
This work presents FG-Net, a general deep learning framework for large-scale point clouds understanding without voxelizations, which achieves accurate and real-time performance with a single NVIDIA GTX 1080 GPU.
Ranked #1 on Semantic Segmentation on Semantic3D
no code implementations • CVPR 2020 • Zhi Gao, Yuwei Wu, Yunde Jia, Mehrtash Harandi
We parameterize the optimizer by the recurrent model and utilize Riemannian operations to ensure that our method is faithful to the geometry of SPD manifolds.
no code implementations • 13 Feb 2020 • Xiaodong Liu, Zhi Gao, Ben M. Chen
Color correction for underwater images has received increasing interests, due to its critical role in facilitating available mature vision algorithms for underwater scenarios.
no code implementations • 17 Nov 2017 • Zhi Gao, Yuwei Wu, Xingyuan Bu, Yunde Jia
To this end, several new layers are introduced in our network, including a nonlinear kernel aggregation layer, an SPD matrix transformation layer, and a vectorization layer.
no code implementations • 29 Mar 2017 • Mo Shan, Fei Wang, Feng Lin, Zhi Gao, Ya Z. Tang, Ben M. Chen
We propose a framework for Google Map aided UAV navigation in GPS-denied environment.
no code implementations • 13 Jan 2017 • Anli Lim, Bharath Ramesh, Yue Yang, Cheng Xiang, Zhi Gao, Feng Lin
This paper describes the development of a novel algorithm to tackle the problem of real-time video stabilization for unmanned aerial vehicles (UAVs).