Search Results for author: Samuel Lipping

Found 5 papers, 3 papers with code

Language-based Audio Retrieval Task in DCASE 2022 Challenge

no code implementations20 Sep 2022 Huang Xie, Samuel Lipping, Tuomas Virtanen

Language-based audio retrieval is a task, where natural language textual captions are used as queries to retrieve audio signals from a dataset.

Audio captioning Retrieval

Language-based Audio Retrieval Task in DCASE 2022 Challenge

1 code implementation13 Jun 2022 Huang Xie, Samuel Lipping, Tuomas Virtanen

Language-based audio retrieval is a task, where natural language textual captions are used as queries to retrieve audio signals from a dataset.

Audio captioning Retrieval

Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering

no code implementations20 Apr 2022 Samuel Lipping, Parthasaarathy Sudarsanam, Konstantinos Drossos, Tuomas Virtanen

Audio question answering (AQA) is a multimodal translation task where a system analyzes an audio signal and a natural language question, to generate a desirable natural language answer.

Audio Question Answering Question Answering

Clotho: An Audio Captioning Dataset

7 code implementations21 Oct 2019 Konstantinos Drossos, Samuel Lipping, Tuomas Virtanen

Audio captioning is the novel task of general audio content description using free text.

Audio captioning Translation

Crowdsourcing a Dataset of Audio Captions

1 code implementation22 Jul 2019 Samuel Lipping, Konstantinos Drossos, Tuomas Virtanen

In this paper we present a three steps based framework for crowdsourcing an audio captioning dataset, based on concepts and practises followed for the creation of widely used image captioning and machine translations datasets.

Sound Audio and Speech Processing

Cannot find the paper you are looking for? You can Submit a new open access paper.