Search Results for author: Jean Mercat

Found 14 papers, 6 papers with code

Should VLMs be Pre-trained with Image Data?

no code implementations10 Mar 2025 Sedrick Keh, Jean Mercat, Samir Yitzhak Gadre, Kushal Arora, Igor Vasiljevic, Benjamin Burchfiel, Shuran Song, Russ Tedrake, Thomas Kollar, Ludwig Schmidt, Achal Dave

We find that pre-training with a mixture of image and text data allows models to perform better on vision-language tasks while maintaining strong performance on text-only evaluations.

Espresso: High Compression For Rich Extraction From Videos for Your Vision-Language Model

no code implementations6 Dec 2024 Keunwoo Peter Yu, Achal Dave, Rares Ambrus, Jean Mercat

Recent advances in vision-language models (VLMs) have shown great promise in connecting images and text, but extending these models to long videos remains challenging due to the rapid growth in token counts.

EgoSchema Language Modeling +2

Linearizing Large Language Models

1 code implementation10 May 2024 Jean Mercat, Igor Vasiljevic, Sedrick Keh, Kushal Arora, Achal Dave, Adrien Gaidon, Thomas Kollar

Linear transformers have emerged as a subquadratic-time alternative to softmax attention and have garnered significant interest due to their fixed-size recurrent state that lowers inference cost.

In-Context Learning Mamba

Residual Q-Learning: Offline and Online Policy Customization without Value

no code implementations NeurIPS 2023 Chenran Li, Chen Tang, Haruki Nishimura, Jean Mercat, Masayoshi Tomizuka, Wei Zhan

Specifically, we formulate the customization problem as a Markov Decision Process (MDP) with a reward function that combines 1) the inherent reward of the demonstration; and 2) the add-on reward specified by the downstream task.

Imitation Learning Q-Learning

RAP: Risk-Aware Prediction for Robust Planning

1 code implementation4 Oct 2022 Haruki Nishimura, Jean Mercat, Blake Wulfe, Rowan Mcallister, Adrien Gaidon

Robust planning in interactive scenarios requires predicting the uncertain future to make risk-aware decisions.

Prediction

Control-Aware Prediction Objectives for Autonomous Driving

no code implementations28 Apr 2022 Rowan Mcallister, Blake Wulfe, Jean Mercat, Logan Ellis, Sergey Levine, Adrien Gaidon

Autonomous vehicle software is typically structured as a modular pipeline of individual components (e. g., perception, prediction, and planning) to help separate concerns into interpretable sub-tasks.

Autonomous Driving Prediction +1

Dynamics-Aware Comparison of Learned Reward Functions

no code implementations ICLR 2022 Blake Wulfe, Ashwin Balakrishna, Logan Ellis, Jean Mercat, Rowan Mcallister, Adrien Gaidon

The ability to learn reward functions plays an important role in enabling the deployment of intelligent agents in the real world.

Higher Order Linear Transformer

no code implementations28 Oct 2020 Jean Mercat

Following up on the linear transformer part of the article from Katharopoulos et al., that takes this idea from Shen et al., the trick that produces a linear complexity for the attention mechanism is re-used and extended to a second-order approximation of the softmax normalization.

Social Attention for Autonomous Decision-Making in Dense Traffic

no code implementations27 Nov 2019 Edouard Leurent, Jean Mercat

We study the design of learning architectures for behavioural planning in a dense traffic setting.

Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.