1 code implementation • 15 Dec 2024 • Mariam Hassan, Sebastian Stapf, Ahmad Rahimi, Pedro M B Rezende, Yasaman Haghighi, David Brüggemann, Isinsu Katircioglu, Lin Zhang, Xiaoran Chen, Suman Saha, Marco Cannici, Elie Aljalbout, Botao Ye, Xi Wang, Aram Davtyan, Mathieu Salzmann, Davide Scaramuzza, Marc Pollefeys, Paolo Favaro, Alexandre Alahi
We present GEM, a Generalizable Ego-vision Multimodal world model that predicts future frames using a reference frame, sparse features, human poses, and ego-trajectories.
no code implementations • 21 Mar 2024 • Aram Davtyan, Sepehr Sameni, Björn Ommer, Paolo Favaro
We call our model CAGE for visual Composition and Animation for video GEneration.
no code implementations • 7 Dec 2023 • Llukman Cerkezi, Aram Davtyan, Sepehr Sameni, Paolo Favaro
The growing interest in novel view synthesis, driven by Neural Radiance Field (NeRF) models, is hindered by scalability issues due to their reliance on precisely annotated multi-view images.
1 code implementation • 6 Jun 2023 • Aram Davtyan, Paolo Favaro
We propose a novel unsupervised method to autoregressively generate videos from a single frame and a sparse motion input.
no code implementations • ICCV 2023 • Aram Davtyan, Sepehr Sameni, Paolo Favaro
We call our model Random frame conditioned flow Integration for VidEo pRediction, or, in short, RIVER.
no code implementations • 13 Apr 2022 • Aram Davtyan, Paolo Favaro
We present GLASS, a method for Global and Local Action-driven Sequence Synthesis.
no code implementations • 7 Jul 2021 • Aram Davtyan, Sepehr Sameni, Llukman Cerkezi, Givi Meishvilli, Adam Bielski, Paolo Favaro
Moreover, we show that the Kalman Filter dynamical model for the evolution of the unknown parameters can be used to capture the gradient dynamics of advanced methods such as Momentum and Adam.