1 code implementation • 1 Jul 2024 • Pooya Fayyazsanavi, Antonios Anastasopoulos, Jana Košecká
Sign language translation from video to spoken text presents unique challenges owing to the distinct grammar, expression nuances, and high variation of visual appearance across different speakers and contexts.
1 code implementation • 20 Nov 2023 • Pooya Fayyazsanavi, Negar Nejatishahidin, Jana Kosecka
We also propose a novel two-stage inference approach that re-ranks the hypotheses using the language model capabilities of the decoder.
no code implementations • 17 Apr 2023 • Pooya Fayyazsanavi, Zhiqiang Wan, Will Hutchcroft, Ivaylo Boyadzhiev, Yuguang Li, Jana Kosecka, Sing Bing Kang
While the existing deep learning-based room layout estimation techniques demonstrate good overall accuracy, they are less effective for distant floor-wall boundary.
no code implementations • 4 Dec 2022 • Negar Nejatishahidin, Pooya Fayyazsanavi
6D object pose estimation problem has been extensively studied in the field of Computer Vision and Robotics.
no code implementations • 27 Nov 2022 • Kourosh T. Baghaei, Amirreza Payandeh, Pooya Fayyazsanavi, Shahram Rahimi, Zhiqian Chen, Somayeh Bakhtiari Ramezani
Machine Learning algorithms have had a profound impact on the field of computer science over the past few decades.
1 code implementation • 2 Mar 2022 • Negar Nejatishahidin, Pooya Fayyazsanavi, Jana Kosecka
The deep convolutional network models (CNN) for pose estimation are typically trained and evaluated on datasets specifically curated for object detection, pose estimation, or 3D reconstruction, which requires large amounts of training data.
Ranked #1 on Pose Estimation on Pix3D