Search Results for author: Artem Ploujnikov

Found 4 papers, 1 papers with code

DASB -- Discrete Audio and Speech Benchmark

no code implementations20 Jun 2024 Pooneh Mousavi, Luca Della Libera, Jarod Duret, Artem Ploujnikov, Cem Subakan, Mirco Ravanelli

Discrete audio tokens have recently gained considerable attention for their potential to connect audio and language processing, enabling the creation of modern multimodal large language models.

Benchmarking Emotion Recognition +7

SoundChoice: Grapheme-to-Phoneme Models with Semantic Disambiguation

1 code implementation27 Jul 2022 Artem Ploujnikov, Mirco Ravanelli

End-to-end speech synthesis models directly convert the input characters into an audio representation (e. g., spectrograms).

Language Modelling Multi-Task Learning +5

Cannot find the paper you are looking for? You can Submit a new open access paper.