Search Results for author: Zhi Gao

Found 19 papers, 5 papers with code

A Real-Time Framework for Domain-Adaptive Underwater Object Detection with Image Enhancement

no code implementations • 28 Mar 2024 • Junjie Wen, Jinqiang Cui, Benyun Zhao, Bingxin Han, Xuchen Liu, Zhi Gao, Ben M. Chen

Furthermore, to ensure balanced training for both tasks, we present a multi-stage training strategy aimed at consistently enhancing their performance.

Domain Adaptation object-detection +2

Paper
Add Code

VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding

no code implementations • 18 Mar 2024 • Yue Fan, Xiaojian Ma, Rujie Wu, Yuntao Du, Jiaqi Li, Zhi Gao, Qing Li

We explore how reconciling several foundation models (large language models and vision-language models) with a novel unified memory mechanism could tackle the challenging video understanding problem, especially capturing the long-term temporal relations in lengthy videos.

Video Understanding

Paper
Add Code

CLOVA: A Closed-Loop Visual Assistant with Tool Usage and Update

no code implementations • 18 Dec 2023 • Zhi Gao, Yuntao Du, Xintong Zhang, Xiaojian Ma, Wenjuan Han, Song-Chun Zhu, Qing Li

However, these methods often overlook the potential for continual learning, typically by freezing the utilized tools, thus limiting their adaptation to environments requiring new knowledge.

Continual Learning Question Answering +1

Paper
Add Code

Global-Local MAV Detection under Challenging Conditions based on Appearance and Motion

1 code implementation • 18 Dec 2023 • Hanqing Guo, Ye Zheng, Yin Zhang, Zhi Gao, Shiyu Zhao

In this paper, we propose a global-local MAV detector that can fuse both motion and appearance features for MAV detection under challenging conditions.

Computational Efficiency

Paper
Code

Exploring Data Geometry for Continual Learning

no code implementations • CVPR 2023 • Zhi Gao, Chen Xu, Feng Li, Yunde Jia, Mehrtash Harandi, Yuwei Wu

Our method dynamically expands the geometry of the underlying space to match growing geometric structures induced by new data, and prevents forgetting by keeping geometric structures of old data into account.

Continual Learning

Paper
Add Code

Meta-causal Learning for Single Domain Generalization

no code implementations • CVPR 2023 • Jin Chen, Zhi Gao, Xinxiao wu, Jiebo Luo

Under this paradigm, we propose a meta-causal learning method to learn meta-knowledge, that is, how to infer the causes of domain shift between the auxiliary and source domains during training.

counterfactual Counterfactual Inference +2

Paper
Add Code

Imbalance Knowledge-Driven Multi-modal Network for Land-Cover Semantic Segmentation Using Images and LiDAR Point Clouds

no code implementations • 28 Mar 2023 • Yameng Wang, Yi Wan, Yongjun Zhang, Bin Zhang, Zhi Gao

The present multi-modal methods usually map high-dimensional features to low-dimensional spaces as a preprocess before feature extraction to address the nonnegligible domain gap, which inevitably leads to information loss.

Semantic Segmentation

Paper
Add Code

SyreaNet: A Physically Guided Underwater Image Enhancement Framework Integrating Synthetic and Real Images

1 code implementation • 16 Feb 2023 • Junjie Wen, Jinqiang Cui, Zhenjun Zhao, Ruixin Yan, Zhi Gao, Lihua Dou, Ben M. Chen

Although learning-based UIE methods have made remarkable achievements in recent years, it's still challenging for them to consistently deal with various underwater conditions, which could be caused by: 1) the use of the simplified atmospheric image formation model in UIE may result in severe errors; 2) the network trained solely with synthetic images might have difficulty in generalizing well to real underwater images.

Domain Adaptation Image Generation +1

Paper
Code

Asymmetric Hash Code Learning for Remote Sensing Image Retrieval

1 code implementation • 15 Jan 2022 • Weiwei Song, Zhi Gao, Renwei Dian, Pedram Ghamisi, Yongjun Zhang, Jón Atli Benediktsson

In this paper, we propose a novel deep hashing method, named asymmetric hash code learning (AHCL), for RSIR.

Deep Hashing Image Retrieval

Paper
Code

Generating Multivariate Load States Using a Conditional Variational Autoencoder

1 code implementation • 21 Oct 2021 • Chenguang Wang, Ensieh Sharifnia, Zhi Gao, Simon H. Tindemans, Peter Palensky

In this paper, a multivariate load state generating model on the basis of a conditional variational autoencoder (CVAE) neural network is proposed.

Paper
Code

Superevents: Towards Native Semantic Segmentation for Event-based Cameras

no code implementations • 13 May 2021 • Weng Fei Low, Ankit Sonthalia, Zhi Gao, André van Schaik, Bharath Ramesh

Most successful computer vision models transform low-level features, such as Gabor filter responses, into richer representations of intermediate or mid-level complexity for downstream visual tasks.

Depth Estimation Semantic Segmentation +1

Paper
Add Code

A Hyperbolic-to-Hyperbolic Graph Convolutional Network

no code implementations • CVPR 2021 • Jindou Dai, Yuwei Wu, Zhi Gao, Yunde Jia

Specifically, we developed a manifold-preserving graph convolution that consists of a hyperbolic feature transformation and a hyperbolic neighborhood aggregation.

General Classification Graph Classification +2

Paper
Add Code

Curvature Generation in Curved Spaces for Few-Shot Learning

no code implementations • ICCV 2021 • Zhi Gao, Yuwei Wu, Yunde Jia, Mehrtash Harandi

Few-shot learning describes the challenging problem of recognizing samples from unseen classes given very few labeled examples.

Few-Shot Learning

Paper
Add Code

FG-Net: Fast Large-Scale LiDAR Point Clouds Understanding Network Leveraging Correlated Feature Mining and Geometric-Aware Modelling

1 code implementation • 17 Dec 2020 • Kangcheng Liu, Zhi Gao, Feng Lin, Ben M. Chen

This work presents FG-Net, a general deep learning framework for large-scale point clouds understanding without voxelizations, which achieves accurate and real-time performance with a single NVIDIA GTX 1080 GPU.

Ranked #1 on Semantic Segmentation on Semantic3D

3D Part Segmentation 3D Point Cloud Classification +4

107

Paper
Code

Learning to Optimize on SPD Manifolds

no code implementations • CVPR 2020 • Zhi Gao, Yuwei Wu, Yunde Jia, Mehrtash Harandi

We parameterize the optimizer by the recurrent model and utilize Riemannian operations to ensure that our method is faithful to the geometry of SPD manifolds.

Clustering Meta-Learning

Paper
Add Code

MLFcGAN: Multi-level Feature Fusion based Conditional GAN for Underwater Image Color Correction

no code implementations • 13 Feb 2020 • Xiaodong Liu, Zhi Gao, Ben M. Chen

Color correction for underwater images has received increasing interests, due to its critical role in facilitating available mature vision algorithms for underwater scenarios.

Generative Adversarial Network

Paper
Add Code

Learning a Robust Representation via a Deep Network on Symmetric Positive Definite Manifolds

no code implementations • 17 Nov 2017 • Zhi Gao, Yuwei Wu, Xingyuan Bu, Yunde Jia

To this end, several new layers are introduced in our network, including a nonlinear kernel aggregation layer, an SPD matrix transformation layer, and a vectorization layer.

Paper
Add Code

Google Map Aided Visual Navigation for UAVs in GPS-denied Environment

no code implementations • 29 Mar 2017 • Mo Shan, Fei Wang, Feng Lin, Zhi Gao, Ya Z. Tang, Ben M. Chen

We propose a framework for Google Map aided UAV navigation in GPS-denied environment.

Optical Flow Estimation Pose Tracking +3

Paper
Add Code

Real-Time Optical flow-based Video Stabilization for Unmanned Aerial Vehicles

no code implementations • 13 Jan 2017 • Anli Lim, Bharath Ramesh, Yue Yang, Cheng Xiang, Zhi Gao, Feng Lin

This paper describes the development of a novel algorithm to tackle the problem of real-time video stabilization for unmanned aerial vehicles (UAVs).

Video Stabilization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.