no code implementations • 7 Mar 2025 • Junpeng Jing, Weixun Luo, Ye Mao, Krystian Mikolajczyk
This paper introduces Stereo Any Video, a powerful framework for video stereo matching.
no code implementations • 2 Feb 2025 • Ye Mao, Weixun Luo, Junpeng Jing, Anlan Qiu, Krystian Mikolajczyk
The rise of vision-language foundation models marks an advancement in bridging the gap between human and machine capabilities in 3D scene reasoning.
no code implementations • 30 Sep 2024 • Junpeng Jing, Ye Mao, Anlan Qiu, Krystian Mikolajczyk
Regarding datasets, current synthetic object-based and indoor datasets are commonly used for training and benchmarking, with a lack of outdoor nature scenarios.
1 code implementation • 25 Apr 2024 • Ye Mao, Junpeng Jing, Krystian Mikolajczyk
In this paper, we present OpenDlign, a novel open-world 3D model using depth-aligned images generated from a diffusion model for robust multimodal alignment.
Ranked #1 on
Zero-shot 3D classification
on OmniObject3D (Pretrained on ShapeNet)
(using extra training data)
no code implementations • 16 Mar 2024 • Junpeng Jing, Ye Mao, Krystian Mikolajczyk
Towards this challenge, we develop a bidirectional alignment mechanism for adjacent frames as a fundamental operation.
no code implementations • 7 Aug 2023 • Wenqiang Lai, Qihan Yang, Ye Mao, Endong Sun, Jiangnan Ye
Voice disorders affect millions of people worldwide.
1 code implementation • 24 Mar 2023 • Lan Jiang, Ye Mao, Xi Chen, Xiangfeng Wang, Chao Li
Diffusion model has emerged as an effective technique for image synthesis by modelling complex and variable data distributions.
1 code implementation • 24 Mar 2023 • Ye Mao, Lan Jiang, Xi Chen, Chao Li
Moreover, DisC-Diff leverages a disentangled multi-stream network to fully exploit complementary information from multi-contrast MRI, improving model interpretation under multiple conditions of multi-contrast inputs.