no code implementations • 17 Sep 2016 • Ankit Gandhi, Arjun Sharma, Arijit Biswas, Om Deshmukh
There are total M+1 (M is the number of modalities) components in the proposed network.
no code implementations • 12 Jul 2016 • Sohil Shah, Kuldeep Kulkarni, Arijit Biswas, Ankit Gandhi, Om Deshmukh, Larry Davis
Typical textual descriptions that accompany online videos are 'weak': i. e., they mention the main concepts in the video but not their corresponding spatio-temporal locations.