Search Results for author: Victor Gomes

Found 1 papers, 0 papers with code

Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities

no code implementations9 Nov 2023 AJ Piergiovanni, Isaac Noble, Dahun Kim, Michael S. Ryoo, Victor Gomes, Anelia Angelova

We propose a multimodal model, called Mirasol3B, consisting of an autoregressive component for the time-synchronized modalities (audio and video), and an autoregressive component for the context modalities which are not necessarily aligned in time but are still sequential.

Action Classification Audio Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.