Search Results for author: Xiaotian Li

Found 16 papers, 2 papers with code

Continual Learning for Image-Based Camera Localization

1 code implementation ICCV 2021 Shuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Juho Kannala

For several emerging technologies such as augmented reality, autonomous driving and robotics, visual localization is a critical component.

Autonomous Driving Camera Localization +2

MOFA: A Model Simplification Roadmap for Image Restoration on Mobile Devices

1 code implementation24 Aug 2023 Xiangyu Chen, Ruiwen Zhen, Shuai Li, Xiaotian Li, Guanghui Wang

Extensive experiments demonstrate that our approach decreases runtime by up to 13% and reduces the number of parameters by up to 23%, while increasing PSNR and SSIM on several image restoration datasets.

Image Restoration SSIM

Full-Frame Scene Coordinate Regression for Image-Based Localization

no code implementations9 Feb 2018 Xiaotian Li, Juha Ylioinas, Juho Kannala

In this paper, instead of in a patch-based manner, we propose to perform the scene coordinate regression in a full-frame manner to make the computation efficient at test time and, more importantly, to add more global context to the regression process to improve the robustness.

Camera Relocalization Data Augmentation +2

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

no code implementations CVPR 2020 Xiaotian Li, Shuzhe Wang, Yi Zhao, Jakob Verbeek, Juho Kannala

In this work, we present a new hierarchical scene coordinate network to predict pixel scene coordinates in a coarse-to-fine manner from a single RGB image.

Classification Data Augmentation +4

Can You Trust Your Pose? Confidence Estimation in Visual Localization

no code implementations1 Oct 2020 Luca Ferranti, Xiaotian Li, Jani Boutellier, Juho Kannala

Camera pose estimation in large-scale environments is still an open question and, despite recent promising results, it may still fail in some situations.

Autonomous Navigation Open-Ended Question Answering +2

Digging Into Self-Supervised Learning of Feature Descriptors

no code implementations10 Oct 2021 Iaroslav Melekhov, Zakaria Laskar, Xiaotian Li, Shuzhe Wang, Juho Kannala

Fully-supervised CNN-based approaches for learning local image descriptors have shown remarkable results in a wide range of geometric tasks.

Image-Based Localization Image Retrieval +3

Your "Attention" Deserves Attention: A Self-Diversified Multi-Channel Attention for Facial Action Analysis

no code implementations23 Mar 2022 Xiaotian Li, Zhihua Li, Huiyuan Yang, Geran Zhao, Lijun Yin

In this paper, we propose a compact model to enhance the representational and focusing power of neural attention maps and learn the "inter-attention" correlation for refined attention maps, which we term the "Self-Diversified Multi-Channel Attention Network (SMA-Net)".

Action Analysis Facial Expression Recognition +1

An EEG-Based Multi-Modal Emotion Database with Both Posed and Authentic Facial Actions for Emotion Analysis

no code implementations29 Mar 2022 Xiaotian Li, Xiang Zhang, Huiyuan Yang, Wenna Duan, Weiying Dai, Lijun Yin

Emotion is an experience associated with a particular pattern of physiological activity along with different physiological, behavioral and cognitive changes.

Cultural Vocal Bursts Intensity Prediction EEG +1

Knowledge-Spreader: Learning Facial Action Unit Dynamics with Extremely Limited Labels

no code implementations30 Mar 2022 Xiaotian Li, Xiang Zhang, Taoyue Wang, Lijun Yin

Recent studies on the automatic detection of facial action unit (AU) have extensively relied on large-sized annotations.

Out-of-Distribution Generalization

Weakly-Supervised Text-driven Contrastive Learning for Facial Behavior Understanding

no code implementations ICCV 2023 Xiang Zhang, Taoyue Wang, Xiaotian Li, Huiyuan Yang, Lijun Yin

This is because such pairs inevitably encode the subject-ID information, and the randomly constructed pairs may push similar facial images away due to the limited number of subjects in facial behavior datasets.

Contrastive Learning Facial Expression Recognition

HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer

no code implementations5 May 2023 Shuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Yi Zhao, Giorgos Tolias, Juho Kannala

In this work, we present a new hierarchical scene coordinate network to predict pixel scene coordinates in a coarse-to-fine manner from a single RGB image.

regression Visual Localization

Knowledge-Spreader: Learning Semi-Supervised Facial Action Dynamics by Consistifying Knowledge Granularity

no code implementations ICCV 2023 Xiaotian Li, Xiang Zhang, Taoyue Wang, Lijun Yin

By formulating SSL as a Progressive Knowledge Distillation (PKD) problem, we aim to infer cross-domain information, specifically from spatial to temporal domains, by consistifying knowledge granularity within Teacher-Students Network.

Knowledge Distillation

Cannot find the paper you are looking for? You can Submit a new open access paper.