Search Results for author: Huda Alamri

Found 6 papers, 4 papers with code

End-to-End Multimodal Representation Learning for Video Dialog

no code implementations26 Oct 2022 Huda Alamri, Anthony Bilic, Michael Hu, Apoorva Beedu, Irfan Essa

Video-based dialog task is a challenging multimodal learning task that has received increasing attention over the past few years with state-of-the-art obtaining new performance records.

Representation Learning Retrieval

Video based Object 6D Pose Estimation using Transformers

1 code implementation24 Oct 2022 Apoorva Beedu, Huda Alamri, Irfan Essa

We introduce a Transformer based 6D Object Pose Estimation framework VideoPose, comprising an end-to-end attention based modelling architecture, that attends to previous frames in order to estimate accurate 6D Object Poses in videos.

6D Pose Estimation 6D Pose Estimation using RGB +1

Cannot find the paper you are looking for? You can Submit a new open access paper.