no code implementations • LTEDI (ACL) 2022 • Morteza Janatdoust, Fatemeh Ehsani-Besheli, Hossein Zeinali
Depression is a common and serious mental illness that early detection can improve the patient’s symptoms and make depression easier to treat.
no code implementations • 24 Jul 2024 • Amirreza Naziri, Hossein Zeinali
Writing, as an omnipresent form of human communication, permeates nearly every aspect of contemporary life.
no code implementations • 19 Jul 2023 • Javad Peymanfard, Vahid Saeedi, Mohammad Reza Mohammadi, Hossein Zeinali, Nasser Mozayani
We evaluate our approach on various tasks, including word-level and sentence-level lip reading, and audiovisual speech recognition using the Arman-AV dataset, a largescale Persian corpus.
no code implementations • 8 Apr 2023 • Javad Peymanfard, Ali Lashini, Samin Heydarian, Hossein Zeinali, Nasser Mozayani
Lip-reading has made impressive progress in recent years, driven by advances in deep learning.
no code implementations • 7 Apr 2023 • Mohammd Hasan Shamgholi, Vahid Saeedi, Javad Peymanfard, Leila Alhabib, Hossein Zeinali
TTS, or text-to-speech, is a complicated process that can be accomplished through appropriate modeling using deep learning methods.
no code implementations • 21 Jan 2023 • Javad Peymanfard, Samin Heydarian, Ali Lashini, Hossein Zeinali, Mohammad Reza Mohammadi, Nasser Mozayani
In addition, we have proposed a technique to detect visemes (a visual equivalent of a phoneme) in Persian.
Audio-Visual Speech Recognition Automatic Speech Recognition +5
1 code implementation • 24 Jul 2022 • Hossein Mirzaee, Javad Peymanfard, Hamid Habibzadeh Moshtaghin, Hossein Zeinali
With the recent proliferation of open textual data on social media platforms, Emotion Detection (ED) from Text has received more attention over the past years.
no code implementations • SEMEVAL 2021 • Niloofar Ranjbar, Hossein Zeinali
In this paper, we describe our proposed methods for the multilingual word-in-Context disambiguation task in SemEval-2021.
no code implementations • 10 Apr 2021 • Javad Peymanfard, Mohammad Reza Mohammadi, Hossein Zeinali, Nasser Mozayani
Lip-reading is the operation of recognizing speech from lip movements.
no code implementations • 13 Dec 2019 • Hossein Zeinali, Kong Aik Lee, Jahangir Alam, Lukas Burget
This document describes the Short-duration Speaker Verification (SdSV) Challenge 2021.
no code implementations • 8 Dec 2019 • Hossein Zeinali, Lukáš Burget, Jan "Honza'' Černocký
We also provide the results of several experiments that can be considered as baselines: HMM-based i-vectors for text-dependent speaker verification, and HMM-based as well as state-of-the-art deep neural network based ASR.
1 code implementation • 19 Oct 2019 • Federico Landini, Shuai Wang, Mireia Diez, Lukáš Burget, Pavel Matějka, Kateřina Žmolíková, Ladislav Mošner, Oldřich Plchot, Ondřej Novotný, Hossein Zeinali, Johan Rohdin
This paper describes the systems developed by the BUT team for the four tracks of the second DIHARD speech diarization challenge.
no code implementations • 16 Oct 2019 • Hossein Zeinali, Shuai Wang, Anna Silnova, Pavel Matějka, Oldřich Plchot
The last two networks are one-dimensional CNN and are based on the x-vector extraction topology.
no code implementations • 13 Jul 2019 • Nooshin Maghsoodi, Hossein Sameti, Hossein Zeinali, Themos~Stafylakis
By making use of the natural partition of input features into digits, we train digit-specific i-vector extractors on top of each HMM and we extract well-localized i-vectors, each modelling merely the phonetic content corresponding to a single digit.
no code implementations • 13 Jul 2019 • Hossein Zeinali, Pavel Matějka, Ladislav Mošner, Oldřich Plchot, Anna Silnova, Ondřej Novotný, Ján Profant, Ondřej Glembek, Lukáš Burget
This is a description of our effort in VOiCES 2019 Speaker Recognition challenge.
no code implementations • 13 Jul 2019 • Hossein Zeinali, Themos Stafylakis, Georgia Athanasopoulou, Johan Rohdin, Ioannis Gkinis, Lukáš Burget, Jan "Honza'' Černocký
In this paper, we present the system description of the joint efforts of Brno University of Technology (BUT) and Omilia -- Conversational Intelligence for the ASVSpoof2019 Spoofing and Countermeasures Challenge.
no code implementations • 5 Nov 2018 • Hossein Zeinali, Lukas Burget, Johan Rohdin, Themos Stafylakis, Jan Cernocky
Recently, speaker embeddings extracted with deep neural networks became the state-of-the-art method for speaker verification.
no code implementations • 28 Sep 2018 • Hossein Zeinali, Lukas Burget, Hossein Sameti, Jan Cernocky
The task of spoken pass-phrase verification is to decide whether a test utterance contains the same phrase as given enrollment utterances.