Search Results for author: Peter Polák

Found 9 papers, 2 papers with code

Long-form Simultaneous Speech Translation: Thesis Proposal

no code implementations17 Oct 2023 Peter Polák

Simultaneous speech translation (SST) aims to provide real-time translation of spoken language, even before the speaker finishes their sentence.

Machine Translation Segmentation +4

Long-Form End-to-End Speech Translation via Latent Alignment Segmentation

no code implementations20 Sep 2023 Peter Polák, Ondřej Bojar

On a diverse set of language pairs and in- and out-of-domain data, we show that the proposed approach achieves state-of-the-art quality at no additional computational cost.

Segmentation Translation

Robustness of Multi-Source MT to Transcription Errors

no code implementations26 May 2023 Dominik Macháček, Peter Polák, Ondřej Bojar, Raj Dabre

Automatic speech translation is sensitive to speech recognition errors, but in a multilingual scenario, the same content may be available in various languages via simultaneous interpreting, dubbing or subtitling.

Machine Translation speech-recognition +2

ALIGNMEET: A Comprehensive Tool for Meeting Annotation, Alignment, and Evaluation

no code implementations LREC 2022 Peter Polák, Muskaan Singh, Anna Nedoluzhko, Ondřej Bojar

To facilitate the research in this area, we present ALIGNMEET, a comprehensive tool for meeting annotation, alignment, and evaluation.

Coarse-To-Fine And Cross-Lingual ASR Transfer

no code implementations2 Sep 2021 Peter Polák, Ondřej Bojar

End-to-end neural automatic speech recognition systems achieved recently state-of-the-art results, but they require large datasets and extensive computing resources.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Cannot find the paper you are looking for? You can Submit a new open access paper.