Audio Question Answering

5 papers with code • 2 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Audio Question Answering

Trend	Dataset	Best Model	Paper	Code	Compare
	DAQA	MALiMo (6 Blocks)			See all
	RoadTracer	XLNet			See all

Datasets

RoadTracer

Latest papers with no code

Most implemented Social Latest No code

Audio Dialogues: Dialogues dataset for audio and music understanding

no code yet • 11 Apr 2024

Existing datasets for audio understanding primarily focus on single-turn interactions (i. e. audio captioning, audio question answering) for describing audio in natural language, thus limiting understanding audio via interactive dialogue.

Paper
Add Code

AQUALLM: Audio Question Answering Data Generation Using Large Language Models

no code yet • 28 Dec 2023

The significance of possessing high-quality, diverse, and extensive AQA datasets cannot be overstated when aiming for the precision of an AQA system.

Paper
Add Code

Attention-Based Methods For Audio Question Answering

no code yet • 31 May 2023

On the yes/no binary classification task, our proposed model achieves an accuracy of 68. 3% compared to 62. 7% in the reference model.

Paper
Add Code

Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering

no code yet • 20 Apr 2022

Audio question answering (AQA) is a multimodal translation task where a system analyzes an audio signal and a natural language question, to generate a desirable natural language answer.

Paper
Add Code

Audio Question Answering

Benchmarks Add a Result

Datasets

Latest papers with no code

Audio Dialogues: Dialogues dataset for audio and music understanding

AQUALLM: Audio Question Answering Data Generation Using Large Language Models

Attention-Based Methods For Audio Question Answering

Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering

Content

Benchmarks

Add a Result