Temporal/Casual QA

4 papers with code • 1 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

Flamingo: a Visual Language Model for Few-Shot Learning

mlfoundations/open_flamingo DeepMind 2022

Building models that can be rapidly adapted to novel tasks using only a handful of annotated examples is an open challenge for multimodal machine learning research.

PaLI-X: On Scaling up a Multilingual Vision and Language Model

kyegomez/PALI 29 May 2023

We present the training recipe and results of scaling up PaLI-X, a multilingual vision and language model, both in terms of size of the components and the breadth of its training task mixture.

Generative Pretraining in Multimodality

baaivision/emu 11 Jul 2023

We present Emu, a Transformer-based multimodal foundation model, which can seamlessly generate images and texts in multimodal context.

PaLI-3 Vision Language Models: Smaller, Faster, Stronger

kyegomez/PALI3 13 Oct 2023

This paper presents PaLI-3, a smaller, faster, and stronger vision language model (VLM) that compares favorably to similar models that are 10x larger.