no code implementations • 12 Dec 2023 • Pinelopi Papalampidi, Skanda Koppula, Shreya Pathak, Justin Chiu, Joe Heyward, Viorica Patraucean, Jiajun Shen, Antoine Miech, Andrew Zisserman, Aida Nematzdeh
Understanding long, real-world videos requires modeling of long-range visual dependencies.