1 code implementation • 18 Feb 2024 • Wenzhao Zheng, Ruiqi Song, Xianda Guo, Chenming Zhang, Long Chen
We then employ a variational autoencoder to learn the future trajectory distribution in a structural latent space for trajectory prior modeling.
1 code implementation • 1 Dec 2023 • Xianda Guo, Juntao Lu, Chenming Zhang, Yiqi Wang, Yiqun Duan, Tian Yang, Zheng Zhu, Long Chen
Based on OpenStereo, we conducted experiments and have achieved or surpassed the performance metrics reported in the original paper.
1 code implementation • 10 May 2023 • Yingjie Tian, Yiqi Wang, Xianda Guo, Zheng Zhu, Long Chen
In recent years, soft prompt learning methods have been proposed to fine-tune large-scale vision-language pre-trained models for various downstream tasks.
no code implementations • ICCV 2023 • Ming Wang, Xianda Guo, Beibei Lin, Tian Yang, Zheng Zhu, Lincheng Li, Shunli Zhang, Xin Yu
This is the first framework on gait recognition that is designed to focus on the extraction of dynamic features.
no code implementations • 14 Mar 2023 • Xianda Guo, Wenjie Yuan, Yunpeng Zhang, Tian Yang, Chenming Zhang, Zheng Zhu, Long Chen
The former is achieved by the self-attention module within each view, while the latter is realized by the adjacent attention module, which computes the attention across multi-cameras to exchange the multi-scale representations across surround-view feature maps.
1 code implementation • 9 Mar 2023 • Yiqun Duan, Xianda Guo, Zheng Zhu
We propose DiffusionDepth, a new approach that reformulates monocular depth estimation as a denoising diffusion process.
1 code implementation • 6 Aug 2022 • Chaoqiang Zhao, Youmin Zhang, Matteo Poggi, Fabio Tosi, Xianda Guo, Zheng Zhu, Guan Huang, Yang Tang, Stefano Mattoccia
Self-supervised monocular depth estimation is an attractive solution that does not require hard-to-source depth labels for training.
Ranked #1 on Monocular Depth Estimation on KITTI
no code implementations • ICCV 2021 • Xianda Guo, Zheng Zhu, Tian Yang, Beibei Lin, JunJie Huang, Jiankang Deng, Guan Huang, Jie zhou, Jiwen Lu
To the best of our knowledge, this is the first large-scale dataset for gait recognition in the wild.
1 code implementation • 8 Mar 2022 • Ming Wang, Beibei Lin, Xianda Guo, Lincheng Li, Zheng Zhu, Jiande Sun, Shunli Zhang, Xin Yu
ECM consists of the Spatial-Temporal feature extractor (ST), the Frame-Level feature extractor (FL) and SPB, and has two obvious advantages: First, each branch focuses on a specific representation, which can be used to improve the robustness of the network.