no code implementations • 5 Oct 2023 • Mengyu Yang, Patrick Grady, Samarth Brahmbhatt, Arun Balajee Vasudevan, Charles C. Kemp, James Hays
How easy is it to sneak up on a robot?
no code implementations • CVPR 2022 • Arun Balajee Vasudevan, Dengxin Dai, Luc van Gool
Specifically, for this study, we investigate binaural sounds and image data in isolation.
no code implementations • 6 Sep 2021 • Dengxin Dai, Arun Balajee Vasudevan, Jiri Matas, Luc van Gool
Humans can robustly recognize and localize objects by using visual and/or auditory cues.
no code implementations • ECCV 2020 • Arun Balajee Vasudevan, Dengxin Dai, Luc van Gool
We also propose two auxiliary tasks namely, a) a novel task on Spatial Sound Super-resolution to increase the spatial resolution of sounds, and b) dense depth prediction of the scene.
no code implementations • 4 Oct 2019 • Arun Balajee Vasudevan, Dengxin Dai, Luc van Gool
Our first contribution is the creation of a large-scale dataset with verbal navigation instructions.
no code implementations • CVPR 2018 • Arun Balajee Vasudevan, Dengxin Dai, Luc van Gool
To that end, we present a new video dataset for OR, with 30, 000 objects over 5, 000 stereo video sequences annotated for their descriptions and gaze.
no code implementations • 10 Nov 2017 • Arun Balajee Vasudevan, Dengxin Dai, Luc van Gool
This paper investigates Object Referring with Spoken Language (ORSpoken) by presenting two datasets and one novel approach.
1 code implementation • 1 May 2017 • Arun Balajee Vasudevan, Michael Gygli, Anna Volokitin, Luc van Gool
Although the problem of automatic video summarization has recently received a lot of attention, the problem of creating a video summary that also highlights elements relevant to a search query has been less studied.