Audio Question Answering

5 papers with code • 2 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Datasets


Latest papers with no code

Audio Dialogues: Dialogues dataset for audio and music understanding

no code yet • 11 Apr 2024

Existing datasets for audio understanding primarily focus on single-turn interactions (i. e. audio captioning, audio question answering) for describing audio in natural language, thus limiting understanding audio via interactive dialogue.

AQUALLM: Audio Question Answering Data Generation Using Large Language Models

no code yet • 28 Dec 2023

The significance of possessing high-quality, diverse, and extensive AQA datasets cannot be overstated when aiming for the precision of an AQA system.

Attention-Based Methods For Audio Question Answering

no code yet • 31 May 2023

On the yes/no binary classification task, our proposed model achieves an accuracy of 68. 3% compared to 62. 7% in the reference model.

Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering

no code yet • 20 Apr 2022

Audio question answering (AQA) is a multimodal translation task where a system analyzes an audio signal and a natural language question, to generate a desirable natural language answer.