1 code implementation • 27 Feb 2024 • Nguyen Nguyen, Jing Bi, Ali Vosoughi, Yapeng Tian, Pooyan Fazli, Chenliang Xu
To address these challenges, in this paper, we introduce the Object State Captioning and State Change Representation (OSCaR) dataset and benchmark.
no code implementations • 8 Nov 2023 • Cheng-Yu Chuang, Pooyan Fazli
We introduce CLearViD, a transformer-based model for video description generation that leverages curriculum learning to accomplish this task.
no code implementations • 7 Nov 2021 • Shasta Ihorn, Yue-Ting Siu, Aditya Bodi, Lothar Narins, Jose M. Castanon, Yash Kant, Abhishek Das, Ilmi Yoon, Pooyan Fazli
To overcome the increasing gaps in video accessibility, we developed a hybrid system of two tools to 1) automatically generate descriptions for videos and 2) provide answers or additional descriptions in response to user queries on a video.
no code implementations • 29 Dec 2019 • Yuxiang Sun, Pooyan Fazli
Policy distillation in deep reinforcement learning provides an effective way to transfer control policies from a larger network to a smaller untrained network without a significant degradation in performance.
no code implementations • 9 Mar 2018 • Mahmoud Hamandi, Mike D'Arcy, Pooyan Fazli
We present a novel human-aware navigation approach, where the robot learns to mimic humans to navigate safely in crowds.