no code implementations • LREC 2022 • Denis Ivanko, Alexandr Axyonov, Dmitry Ryumin, Alexey Kashevnik, Alexey Karpov
We present a new audio-visual speech corpus (RUSAVIC) recorded in a car environment and designed for noise-robust speech recognition.
no code implementations • 19 Mar 2024 • Elena Ryumina, Maxim Markitantov, Dmitry Ryumin, Heysem Kaya, Alexey Karpov
Our findings from the challenge demonstrate that the proposed method can potentially form a basis for developing intelligent tools for annotating audio-visual data in the context of human's basic and compound emotions.
1 code implementation • Expert Systems with Applications 2023 • Elena Ryumina, Maxim Markitantov, Dmitry Ryumin, Alexey Karpov
Psychological and neurological studies earlier suggested that a personality type can be determined by the whole face as well as by its sides.
Personality Trait Recognition Personality Trait Recognition by Face
no code implementations • The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences 2023 • Dmitry Ryumin, Denis Ivanko, Alexandr Axyonov
Automatic sign gesture recognition (GR) plays a critical role in facilitating communication between hearing-impaired individuals and the rest of society.
Ranked #5 on Sign Language Recognition on AUTSL
no code implementations • Sensors 2023 • Dmitry Ryumin, Denis Ivanko, Elena Ryumina
Audio-visual speech recognition (AVSR) is one of the most promising solutions for reliable speech recognition, particularly when audio is corrupted by noise.
Ranked #1 on Sign Language Recognition on AUTSL
no code implementations • 30th European Signal Processing Conference (EUSIPCO) 2022 • Denis Ivanko, Dmitry Ryumin, Alexey Kashevnik, Alexandr Axyonov, Alexey Karpov
After a comprehensive evaluation, we adapt the developed method and test it on the collected RUSAVIC corpus we recorded in-the-wild for vehicle driver.
Ranked #3 on Lipreading on Lip Reading in the Wild
no code implementations • LREC 2020 • Ildar Kagirov, Denis Ivanko, Dmitry Ryumin, Alex Axyonov, er, Alexey Karpov
The database includes lexical units (single words and phrases) from Russian sign language within one subject area, namely, {``}food products at the supermarket{''}, and was collected using MS Kinect 2. 0 device including both FullHD video and the depth map modes, which provides new opportunities for the lexicographical description of the Russian sign language vocabulary and enhances research in the field of automatic gesture recognition.