no code implementations • 26 Dec 2023 • Zhijie Shen, Chunyu Lin, Junsong Zhang, Lang Nie, Kang Liao, Yao Zhao
Existing panoramic layout estimation solutions tend to recover room boundaries from a vertically compressed sequence, yielding imprecise results as the compression process often muddles the semantics between various planes.
1 code implementation • CVPR 2023 • Zhijie Shen, Zishuo Zheng, Chunyu Lin, Lang Nie, Kang Liao, Shuai Zheng, Yao Zhao
Based on the Manhattan World assumption, most existing indoor layout estimation schemes focus on recovering layouts from vertically compressed 1D sequences.
1 code implementation • 28 Feb 2023 • Zhijie Shen, Wu Guo, Bin Gu
In this paper, we propose a language-universal adapter learning framework based on a pre-trained model for end-to-end multilingual automatic speech recognition (ASR).
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
1 code implementation • 3 Aug 2022 • Zhijie Shen, Chunyu Lin, Lang Nie, Kang Liao, Yao Zhao
For a monocular 360 image, depth estimation is a challenging because the distortion increases along the latitude.
Ranked #8 on Depth Estimation on Stanford2D3D Panoramic
no code implementations • 6 Jul 2022 • Zishuo Zheng, Chunyu Lin, Lang Nie, Kang Liao, Zhijie Shen, Yao Zhao
In this paper, we combine the two different representations and propose a novel 360{\deg} semantic segmentation solution from a complementary perspective.
no code implementations • 18 Mar 2022 • Zhijie Shen, Chunyu Lin, Lang Nie, Kang Liao, Yao Zhao
It comprises two modules: Dual-Cubemap Depth Estimation (DCDE) module and Boundary Revision (BR) module.
1 code implementation • 17 Mar 2022 • Zhijie Shen, Chunyu Lin, Kang Liao, Lang Nie, Zishuo Zheng, Yao Zhao
In particular, we divide patches on the spherical tangent domain into tokens to reduce the negative effect of panoramic distortions.
Ranked #4 on Depth Estimation on Stanford2D3D Panoramic