1 code implementation • 20 Mar 2024 • Yuxuan Zhou, Xingxing Li, Shengyu Li, Xuanbin Wang, Shaoquan Feng, Yuxuan Tan
Visual simultaneous localization and mapping (VSLAM) has broad applications, with state-of-the-art methods leveraging deep neural networks for better robustness and applicability.
no code implementations • ACM 2023 • Yuxuan Tan, Yuanman Li∗
Additionally, in order to handle scale transformations, we introduce a multi-scale projection method, which can be readily integrated into our target-aware framework that enables the attention process to be conducted between tokens containing information of varying scales.
no code implementations • 18 Aug 2023 • Yuxuan Tan, Yuanman Li, Limin Zeng, Jiaxiong Ye, Wei Wang, Xia Li
Additionally, in order to handle scale transformations, we introduce a multi-scale projection method, which can be readily integrated into our target-aware framework that enables the attention process to be conducted between tokens containing information of varying scales.