no code implementations • CVPR 2024 • Xiaotian Li, Baojie Fan, Jiandong Tian, Huijie Fan
In the following, LiDAR-guided adaptive fusion transformer (LGAFT) is developed to adaptively enhance the interaction of different modal BEV features from a global perspective.
no code implementations • 22 Jul 2024 • Xinghao Wu, Xuefeng Liu, Jianwei Niu, Guogang Zhu, Shaojie Tang, Xiaotian Li, Jiannong Cao
We note that when dealing with clients with similar data distributions, methods such as personalized weight aggregation tend to enforce their models to be close in the parameter space.
1 code implementation • 24 Aug 2023 • Xiangyu Chen, Ruiwen Zhen, Shuai Li, Xiaotian Li, Guanghui Wang
Extensive experiments demonstrate that our approach decreases runtime by up to 13% and reduces the number of parameters by up to 23%, while increasing PSNR and SSIM on several image restoration datasets.
no code implementations • 5 May 2023 • Shuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Yi Zhao, Giorgos Tolias, Juho Kannala
In this work, we present a new hierarchical scene coordinate network to predict pixel scene coordinates in a coarse-to-fine manner from a single RGB image.
no code implementations • ICCV 2023 • Xiang Zhang, Taoyue Wang, Xiaotian Li, Huiyuan Yang, Lijun Yin
This is because such pairs inevitably encode the subject-ID information, and the randomly constructed pairs may push similar facial images away due to the limited number of subjects in facial behavior datasets.
no code implementations • ICCV 2023 • Xiaotian Li, Taoyue Wang, Geran Zhao, Xiang Zhang, Xi Kang, Lijun Yin
Diverse visual stimuli can evoke various human affective states, which are usually manifested in an individual's muscular actions and facial expressions.
no code implementations • ICCV 2023 • Xiaotian Li, Xiang Zhang, Taoyue Wang, Lijun Yin
By formulating SSL as a Progressive Knowledge Distillation (PKD) problem, we aim to infer cross-domain information, specifically from spatial to temporal domains, by consistifying knowledge granularity within Teacher-Students Network.
no code implementations • 25 Sep 2022 • Xiang Zhang, Huiyuan Yang, Taoyue Wang, Xiaotian Li, Lijun Yin
Recent studies have focused on utilizing multi-modal data to develop robust models for facial Action Unit (AU) detection.
no code implementations • 30 Mar 2022 • Xiaotian Li, Xiang Zhang, Taoyue Wang, Lijun Yin
Recent studies on the automatic detection of facial action unit (AU) have extensively relied on large-sized annotations.
no code implementations • 29 Mar 2022 • Xiaotian Li, Xiang Zhang, Huiyuan Yang, Wenna Duan, Weiying Dai, Lijun Yin
Emotion is an experience associated with a particular pattern of physiological activity along with different physiological, behavioral and cognitive changes.
no code implementations • 23 Mar 2022 • Xiaotian Li, Zhihua Li, Huiyuan Yang, Geran Zhao, Lijun Yin
In this paper, we propose a compact model to enhance the representational and focusing power of neural attention maps and learn the "inter-attention" correlation for refined attention maps, which we term the "Self-Diversified Multi-Channel Attention Network (SMA-Net)".
no code implementations • 10 Oct 2021 • Iaroslav Melekhov, Zakaria Laskar, Xiaotian Li, Shuzhe Wang, Juho Kannala
Fully-supervised CNN-based approaches for learning local image descriptors have shown remarkable results in a wide range of geometric tasks.
1 code implementation • ICCV 2021 • Shuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Juho Kannala
For several emerging technologies such as augmented reality, autonomous driving and robotics, visual localization is a critical component.
no code implementations • 1 Mar 2021 • Ran Yi, Yang Zhou, Xin Wang, Zhiyuan Liu, Xiaotian Li, Bin Ran
This paper presents an infrastructure assisted constrained connected automated vehicles (CAVs) trajectory optimization method on curved roads.
no code implementations • 1 Oct 2020 • Luca Ferranti, Xiaotian Li, Jani Boutellier, Juho Kannala
Camera pose estimation in large-scale environments is still an open question and, despite recent promising results, it may still fail in some situations.
no code implementations • CVPR 2020 • Xiaotian Li, Shuzhe Wang, Yi Zhao, Jakob Verbeek, Juho Kannala
In this work, we present a new hierarchical scene coordinate network to predict pixel scene coordinates in a coarse-to-fine manner from a single RGB image.
no code implementations • 15 Aug 2018 • Xiaotian Li, Juha Ylioinas, Jakob Verbeek, Juho Kannala
Image-based camera relocalization is an important problem in computer vision and robotics.
no code implementations • 9 Feb 2018 • Xiaotian Li, Juha Ylioinas, Juho Kannala
In this paper, instead of in a patch-based manner, we propose to perform the scene coordinate regression in a full-frame manner to make the computation efficient at test time and, more importantly, to add more global context to the regression process to improve the robustness.