Search Results for author: Ziyu Ma

Found 3 papers, 0 papers with code

GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering

no code implementations4 Feb 2024 Ziyu Ma, Shutao Li, Bin Sun, Jianfei Cai, Zuxiang Long, Fuyan Ma

Therefore, we propose GeReA, a generate-reason framework that prompts a MLLM like InstructBLIP with question relevant vision and language information to generate knowledge-relevant descriptions and reasons those descriptions for knowledge-based VQA.

Language Modelling Large Language Model +3

Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation

no code implementations5 Jul 2022 Bin Li, Yixuan Weng, Ziyu Ma, Bin Sun, Shutao Li

To fully leverage the visual information for both scene understanding and dialogue generation, we propose the scene-aware prompt for the MDUG task.

Dialogue Generation Dialogue Understanding +2

Hybrid Mutimodal Fusion for Dimensional Emotion Recognition

no code implementations16 Oct 2021 Ziyu Ma, Fuyan Ma, Bin Sun, Shutao Li

For the MuSe-Stress sub-challenge, we highlight our solutions in three aspects: 1) the audio-visual features and the bio-signal features are used for emotional state recognition.

Emotion Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.