Search Results for author: Fuyan Ma

Found 5 papers, 0 papers with code

GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering

no code implementations • 4 Feb 2024 • Ziyu Ma, Shutao Li, Bin Sun, Jianfei Cai, Zuxiang Long, Fuyan Ma

Therefore, we propose GeReA, a generate-reason framework that prompts a MLLM like InstructBLIP with question relevant vision and language information to generate knowledge-relevant descriptions and reasons those descriptions for knowledge-based VQA.

Language Modelling Large Language Model +3

Paper
Add Code

LOGO-Former: Local-Global Spatio-Temporal Transformer for Dynamic Facial Expression Recognition

no code implementations • 5 May 2023 • Fuyan Ma, Bin Sun, Shutao Li

Previous methods for dynamic facial expression recognition (DFER) in the wild are mainly based on Convolutional Neural Networks (CNNs), whose local operations ignore the long-range dependencies in videos.

Dynamic Facial Expression Recognition Facial Expression Recognition

Paper
Add Code

Spatio-Temporal Transformer for Dynamic Facial Expression Recognition in the Wild

no code implementations • 10 May 2022 • Fuyan Ma, Bin Sun, Shutao Li

Previous methods for dynamic facial expression in the wild are mainly based on Convolutional Neural Networks (CNNs), whose local operations ignore the long-range dependencies in videos.

Ranked #5 on Dynamic Facial Expression Recognition on FERV39k

Dynamic Facial Expression Recognition Facial Expression Recognition +1

Paper
Add Code

Hybrid Mutimodal Fusion for Dimensional Emotion Recognition

no code implementations • 16 Oct 2021 • Ziyu Ma, Fuyan Ma, Bin Sun, Shutao Li

For the MuSe-Stress sub-challenge, we highlight our solutions in three aspects: 1) the audio-visual features and the bio-signal features are used for emotional state recognition.

Emotion Recognition

Paper
Add Code

Facial Expression Recognition with Visual Transformers and Attentional Selective Fusion

no code implementations • 31 Mar 2021 • Fuyan Ma, Bin Sun, Shutao Li

Facial Expression Recognition (FER) in the wild is extremely challenging due to occlusions, variant head poses, face deformation and motion blur under unconstrained conditions.

Facial Expression Recognition Facial Expression Recognition (FER)

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.