no code implementations • EAMT 2020 • Ondřej Bojar, Dominik Macháček, Sangeet Sagar, Otakar Smrž, Jonáš Kratochvíl, Ebrahim Ansari, Dario Franceschini, Chiara Canton, Ivan Simonini, Thai-Son Nguyen, Felix Schneider, Sebastian Stücker, Alex Waibel, Barry Haddow, Rico Sennrich, Philip Williams
ELITR (European Live Translator) project aims to create a speech translation system for simultaneous subtitling of conferences and online meetings targetting up to 43 languages.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
1 code implementation • EACL 2021 • Ebrahim Ansari, Ond{\v{r}}ej Bojar, Barry Haddow, Mohammad Mahmoudi
SLTev reports the quality, latency, and stability of an SLT candidate output based on the time-stamped transcript and reference translation into a target language.
no code implementations • EACL 2021 • Ond{\v{r}}ej Bojar, Dominik Mach{\'a}{\v{c}}ek, Sangeet Sagar, Otakar Smr{\v{z}}, Jon{\'a}{\v{s}} Kratochv{\'\i}l, Peter Pol{\'a}k, Ebrahim Ansari, Mohammad Mahmoudi, Rishu Kumar, Dario Franceschini, Chiara Canton, Ivan Simonini, Thai-Son Nguyen, Felix Schneider, Sebastian St{\"u}ker, Alex Waibel, Barry Haddow, Rico Sennrich, Philip Williams
This paper presents an automatic speech translation system aimed at live subtitling of conference presentations.
no code implementations • WS 2020 • Ebrahim Ansari, Amittai Axelrod, Nguyen Bach, Ond{\v{r}}ej Bojar, Roldano Cattoni, Fahim Dalvi, Nadir Durrani, Marcello Federico, Christian Federmann, Jiatao Gu, Fei Huang, Kevin Knight, Xutai Ma, Ajay Nagesh, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Xing Shi, Sebastian St{\"u}ker, Marco Turchi, Alex Waibel, er, Changhan Wang
The evaluation campaign of the International Conference on Spoken Language Translation (IWSLT 2020) featured this year six challenge tracks: (i) Simultaneous speech translation, (ii) Video speech translation, (iii) Offline speech translation, (iv) Conversational speech translation, (v) Open domain translation, and (vi) Non-native speech translation.
no code implementations • LREC 2020 • Hadi Abdi Khojasteh, Ebrahim Ansari, Mahdi Bohlouli
In order to target this gap for low resource languages, we propose a "Large Scale Colloquial Persian Dataset" (LSCP).
no code implementations • 23 Feb 2020 • Hadi Abdi Khojasteh, Ebrahim Ansari, Parvin Razzaghi, Akbar Karimi
This paper considers the task of matching images and sentences by learning a visual-textual embedding space for cross-modal retrieval.
no code implementations • RANLP 2019 • Ebrahim Ansari, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}, Mohammad Mahmoudi, Hamid Haghdoost, Jon{\'a}{\v{s}} Vidra
In the experimental phase, using the hand-annotated Persian lexicon and two smaller similar lexicons for Czech and Finnish languages, we evaluated the effect of the training data size, different hyper-parameters settings as well as different RNN-based models.
no code implementations • 10 Dec 2018 • Hadi Abdi Khojasteh, Alireza Abbas Alipour, Ebrahim Ansari, Parvin Razzaghi
Nowadays, automobile manufacturers make efforts to develop ways to make cars fully safe.
no code implementations • LREC 2018 • Akbar Karimi, Ebrahim Ansari, Bahram Sadeghi Bigham
Two machine translation systems are employed to translate from Persian to English and the reverse after which an IR system is used to measure the similarity of the translated sentences.
no code implementations • 29 Jan 2017 • Ebrahim Ansari, M. H. Sadreddini, Mostafa Sheikhalishahi, Richard Wallace, Fatemeh Alimardani
Experimental results show that using the new pivot based extraction can increase the quality of bilingual corpus significantly and consequently improves the performance of the Persian-Italian SMT system.
no code implementations • 29 Jan 2017 • Ebrahim Ansari, M. H. Sadreddini, Lucio Grandinetti, Mahsa Radinmehr, Ziba Khosravan, Mehdi Sheikhalishahi
In order to combine seed dictionaries, we propose two different combination models and examine the effect of our novel combination models on various comparable corpora that have differing degrees of comparability.