Search Results for author: Ziyu Ma

Found 3 papers, 0 papers with code

GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering

no code implementations • 4 Feb 2024 • Ziyu Ma, Shutao Li, Bin Sun, Jianfei Cai, Zuxiang Long, Fuyan Ma

Therefore, we propose GeReA, a generate-reason framework that prompts a MLLM like InstructBLIP with question relevant vision and language information to generate knowledge-relevant descriptions and reasons those descriptions for knowledge-based VQA.

Language Modelling Large Language Model +3

Paper
Add Code

Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation

no code implementations • 5 Jul 2022 • Bin Li, Yixuan Weng, Ziyu Ma, Bin Sun, Shutao Li

To fully leverage the visual information for both scene understanding and dialogue generation, we propose the scene-aware prompt for the MDUG task.

Dialogue Generation Dialogue Understanding +2

Paper
Add Code

Hybrid Mutimodal Fusion for Dimensional Emotion Recognition

no code implementations • 16 Oct 2021 • Ziyu Ma, Fuyan Ma, Bin Sun, Shutao Li

For the MuSe-Stress sub-challenge, we highlight our solutions in three aspects: 1) the audio-visual features and the bio-signal features are used for emotional state recognition.

Emotion Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.