Search Results for author: Xianda Guo

Found 9 papers, 6 papers with code

GenAD: Generative End-to-End Autonomous Driving

1 code implementation18 Feb 2024 Wenzhao Zheng, Ruiqi Song, Xianda Guo, Chenming Zhang, Long Chen

We then employ a variational autoencoder to learn the future trajectory distribution in a structural latent space for trajectory prior modeling.

Autonomous Driving motion prediction

OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline

1 code implementation1 Dec 2023 Xianda Guo, Juntao Lu, Chenming Zhang, Yiqi Wang, Yiqun Duan, Tian Yang, Zheng Zhu, Long Chen

Based on OpenStereo, we conducted experiments and have achieved or surpassed the performance metrics reported in the original paper.

Autonomous Driving Autonomous Navigation +1

Multi-Prompt with Depth Partitioned Cross-Modal Learning

1 code implementation10 May 2023 Yingjie Tian, Yiqi Wang, Xianda Guo, Zheng Zhu, Long Chen

In recent years, soft prompt learning methods have been proposed to fine-tune large-scale vision-language pre-trained models for various downstream tasks.

Domain Generalization

A Simple Baseline for Supervised Surround-view Depth Estimation

no code implementations14 Mar 2023 Xianda Guo, Wenjie Yuan, Yunpeng Zhang, Tian Yang, Chenming Zhang, Zheng Zhu, Long Chen

The former is achieved by the self-attention module within each view, while the latter is realized by the adjacent attention module, which computes the attention across multi-cameras to exchange the multi-scale representations across surround-view feature maps.

Autonomous Driving Monocular Depth Estimation

DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation

1 code implementation9 Mar 2023 Yiqun Duan, Xianda Guo, Zheng Zhu

We propose DiffusionDepth, a new approach that reformulates monocular depth estimation as a denoising diffusion process.

Denoising Monocular Depth Estimation

GaitStrip: Gait Recognition via Effective Strip-based Feature Representations and Multi-Level Framework

1 code implementation8 Mar 2022 Ming Wang, Beibei Lin, Xianda Guo, Lincheng Li, Zheng Zhu, Jiande Sun, Shunli Zhang, Xin Yu

ECM consists of the Spatial-Temporal feature extractor (ST), the Frame-Level feature extractor (FL) and SPB, and has two obvious advantages: First, each branch focuses on a specific representation, which can be used to improve the robustness of the network.

Gait Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.