Search Results for author: Xiaotian Li

Found 16 papers, 2 papers with code

Continual Learning for Image-Based Camera Localization

1 code implementation • ICCV 2021 • Shuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Juho Kannala

For several emerging technologies such as augmented reality, autonomous driving and robotics, visual localization is a critical component.

Autonomous Driving Camera Localization +2

Paper
Code

MOFA: A Model Simplification Roadmap for Image Restoration on Mobile Devices

1 code implementation • 24 Aug 2023 • Xiangyu Chen, Ruiwen Zhen, Shuai Li, Xiaotian Li, Guanghui Wang

Extensive experiments demonstrate that our approach decreases runtime by up to 13% and reduces the number of parameters by up to 23%, while increasing PSNR and SSIM on several image restoration datasets.

Image Restoration SSIM

Paper
Code

Full-Frame Scene Coordinate Regression for Image-Based Localization

no code implementations • 9 Feb 2018 • Xiaotian Li, Juha Ylioinas, Juho Kannala

In this paper, instead of in a patch-based manner, we propose to perform the scene coordinate regression in a full-frame manner to make the computation efficient at test time and, more importantly, to add more global context to the regression process to improve the robustness.

Camera Relocalization Data Augmentation +2

Paper
Add Code

Scene Coordinate Regression with Angle-Based Reprojection Loss for Camera Relocalization

no code implementations • 15 Aug 2018 • Xiaotian Li, Juha Ylioinas, Jakob Verbeek, Juho Kannala

Image-based camera relocalization is an important problem in computer vision and robotics.

Camera Relocalization regression

Paper
Add Code

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

no code implementations • CVPR 2020 • Xiaotian Li, Shuzhe Wang, Yi Zhao, Jakob Verbeek, Juho Kannala

In this work, we present a new hierarchical scene coordinate network to predict pixel scene coordinates in a coarse-to-fine manner from a single RGB image.

Classification Data Augmentation +4

Paper
Add Code

Can You Trust Your Pose? Confidence Estimation in Visual Localization

no code implementations • 1 Oct 2020 • Luca Ferranti, Xiaotian Li, Jani Boutellier, Juho Kannala

Camera pose estimation in large-scale environments is still an open question and, despite recent promising results, it may still fail in some situations.

Autonomous Navigation Open-Ended Question Answering +2

Paper
Add Code

Infrastructure Assisted Constrained Connected Automated Vehicle Trajectory Optimization on Curved Roads: A Spatial Formulation on a Curvilinear Coordinate

no code implementations • 1 Mar 2021 • Ran Yi, Yang Zhou, Xin Wang, Zhiyuan Liu, Xiaotian Li, Bin Ran

This paper presents an infrastructure assisted constrained connected automated vehicles (CAVs) trajectory optimization method on curved roads.

Model Predictive Control

Paper
Add Code

Digging Into Self-Supervised Learning of Feature Descriptors

no code implementations • 10 Oct 2021 • Iaroslav Melekhov, Zakaria Laskar, Xiaotian Li, Shuzhe Wang, Juho Kannala

Fully-supervised CNN-based approaches for learning local image descriptors have shown remarkable results in a wide range of geometric tasks.

Image-Based Localization Image Retrieval +3

Paper
Add Code

Your "Attention" Deserves Attention: A Self-Diversified Multi-Channel Attention for Facial Action Analysis

no code implementations • 23 Mar 2022 • Xiaotian Li, Zhihua Li, Huiyuan Yang, Geran Zhao, Lijun Yin

In this paper, we propose a compact model to enhance the representational and focusing power of neural attention maps and learn the "inter-attention" correlation for refined attention maps, which we term the "Self-Diversified Multi-Channel Attention Network (SMA-Net)".

Action Analysis Facial Expression Recognition +1

Paper
Add Code

An EEG-Based Multi-Modal Emotion Database with Both Posed and Authentic Facial Actions for Emotion Analysis

no code implementations • 29 Mar 2022 • Xiaotian Li, Xiang Zhang, Huiyuan Yang, Wenna Duan, Weiying Dai, Lijun Yin

Emotion is an experience associated with a particular pattern of physiological activity along with different physiological, behavioral and cognitive changes.

Cultural Vocal Bursts Intensity Prediction EEG +1

Paper
Add Code

Knowledge-Spreader: Learning Facial Action Unit Dynamics with Extremely Limited Labels

no code implementations • 30 Mar 2022 • Xiaotian Li, Xiang Zhang, Taoyue Wang, Lijun Yin

Recent studies on the automatic detection of facial action unit (AU) have extensively relied on large-sized annotations.

Out-of-Distribution Generalization

Paper
Add Code

Multimodal Channel-Mixing: Channel and Spatial Masked AutoEncoder on Facial Action Unit Detection

no code implementations • 25 Sep 2022 • Xiang Zhang, Huiyuan Yang, Taoyue Wang, Xiaotian Li, Lijun Yin

Recent studies have focused on utilizing multi-modal data to develop robust models for facial Action Unit (AU) detection.

Action Unit Detection Facial Action Unit Detection +1

Paper
Add Code

Weakly-Supervised Text-driven Contrastive Learning for Facial Behavior Understanding

no code implementations • ICCV 2023 • Xiang Zhang, Taoyue Wang, Xiaotian Li, Huiyuan Yang, Lijun Yin

This is because such pairs inevitably encode the subject-ID information, and the randomly constructed pairs may push similar facial images away due to the limited number of subjects in facial behavior datasets.

Contrastive Learning Facial Expression Recognition

Paper
Add Code

HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer

no code implementations • 5 May 2023 • Shuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Yi Zhao, Giorgos Tolias, Juho Kannala

In this work, we present a new hierarchical scene coordinate network to predict pixel scene coordinates in a coarse-to-fine manner from a single RGB image.

regression Visual Localization

Paper
Add Code

ReactioNet: Learning High-Order Facial Behavior from Universal Stimulus-Reaction by Dyadic Relation Reasoning

no code implementations • ICCV 2023 • Xiaotian Li, Taoyue Wang, Geran Zhao, Xiang Zhang, Xi Kang, Lijun Yin

Diverse visual stimuli can evoke various human affective states, which are usually manifested in an individual's muscular actions and facial expressions.

Action Unit Detection Contrastive Learning +4

Paper
Add Code

Knowledge-Spreader: Learning Semi-Supervised Facial Action Dynamics by Consistifying Knowledge Granularity

no code implementations • ICCV 2023 • Xiaotian Li, Xiang Zhang, Taoyue Wang, Lijun Yin

By formulating SSL as a Progressive Knowledge Distillation (PKD) problem, we aim to infer cross-domain information, specifically from spatial to temporal domains, by consistifying knowledge granularity within Teacher-Students Network.

Knowledge Distillation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.