no code implementations • 4 Feb 2024 • Ziyu Ma, Shutao Li, Bin Sun, Jianfei Cai, Zuxiang Long, Fuyan Ma
Therefore, we propose GeReA, a generate-reason framework that prompts a MLLM like InstructBLIP with question relevant vision and language information to generate knowledge-relevant descriptions and reasons those descriptions for knowledge-based VQA.
no code implementations • 5 May 2023 • Fuyan Ma, Bin Sun, Shutao Li
Previous methods for dynamic facial expression recognition (DFER) in the wild are mainly based on Convolutional Neural Networks (CNNs), whose local operations ignore the long-range dependencies in videos.
Dynamic Facial Expression Recognition Facial Expression Recognition
no code implementations • 10 May 2022 • Fuyan Ma, Bin Sun, Shutao Li
Previous methods for dynamic facial expression in the wild are mainly based on Convolutional Neural Networks (CNNs), whose local operations ignore the long-range dependencies in videos.
Ranked #5 on Dynamic Facial Expression Recognition on FERV39k
Dynamic Facial Expression Recognition Facial Expression Recognition +1
no code implementations • 16 Oct 2021 • Ziyu Ma, Fuyan Ma, Bin Sun, Shutao Li
For the MuSe-Stress sub-challenge, we highlight our solutions in three aspects: 1) the audio-visual features and the bio-signal features are used for emotional state recognition.
no code implementations • 31 Mar 2021 • Fuyan Ma, Bin Sun, Shutao Li
Facial Expression Recognition (FER) in the wild is extremely challenging due to occlusions, variant head poses, face deformation and motion blur under unconstrained conditions.
Facial Expression Recognition Facial Expression Recognition (FER)