Search Results for author: Yann Dubois

Found 13 papers, 11 papers with code

Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators

1 code implementation6 Apr 2024 Yann Dubois, Balázs Galambosi, Percy Liang, Tatsunori B. Hashimoto

Even simple, known confounders such as preference for longer outputs remain in existing automated evaluation metrics.

Chatbot counterfactual

Identifying the Risks of LM Agents with an LM-Emulated Sandbox

1 code implementation25 Sep 2023 Yangjun Ruan, Honghua Dong, Andrew Wang, Silviu Pitis, Yongchao Zhou, Jimmy Ba, Yann Dubois, Chris J. Maddison, Tatsunori Hashimoto

Alongside the emulator, we develop an LM-based automatic safety evaluator that examines agent failures and quantifies associated risks.

Language Modelling valid

AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback

2 code implementations NeurIPS 2023 Yann Dubois, Xuechen Li, Rohan Taori, Tianyi Zhang, Ishaan Gulrajani, Jimmy Ba, Carlos Guestrin, Percy Liang, Tatsunori B. Hashimoto

As a demonstration of the research possible in AlpacaFarm, we find that methods that use a reward model can substantially improve over supervised fine-tuning and that our reference PPO implementation leads to a +10% improvement in win-rate against Davinci003.

Instruction Following

Evaluating Self-Supervised Learning via Risk Decomposition

1 code implementation6 Feb 2023 Yann Dubois, Tatsunori Hashimoto, Percy Liang

Our decomposition consists of four error components: approximation, representation usability, probe generalization, and encoder generalization.

Representation Learning Self-Supervised Learning

Is a Caption Worth a Thousand Images? A Controlled Study for Representation Learning

no code implementations15 Jul 2022 Shibani Santurkar, Yann Dubois, Rohan Taori, Percy Liang, Tatsunori Hashimoto

The development of CLIP [Radford et al., 2021] has sparked a debate on whether language supervision can result in vision models with more transferable representations than traditional image-only methods.

Descriptive Representation Learning

Optimal Representations for Covariate Shift

2 code implementations ICLR 2022 Yangjun Ruan, Yann Dubois, Chris J. Maddison

Machine learning systems often experience a distribution shift between training and testing.

Ranked #38 on Image Classification on ObjectNet (using extra training data)

Domain Generalization Image Classification +1

Learning Optimal Representations with the Decodable Information Bottleneck

1 code implementation NeurIPS 2020 Yann Dubois, Douwe Kiela, David J. Schwab, Ramakrishna Vedantam

We address the question of characterizing and finding optimal representations for supervised learning.

Location Attention for Extrapolation to Longer Sequences

no code implementations ACL 2020 Yann Dubois, Gautier Dagan, Dieuwke Hupkes, Elia Bruni

We hypothesize that models with a separate content- and location-based attention are more likely to extrapolate than those with common attention mechanisms.

Convolutional Conditional Neural Processes

3 code implementations ICLR 2020 Jonathan Gordon, Wessel P. Bruinsma, Andrew Y. K. Foong, James Requeima, Yann Dubois, Richard E. Turner

We introduce the Convolutional Conditional Neural Process (ConvCNP), a new member of the Neural Process family that models translation equivariance in the data.

Inductive Bias Time Series +3

Cannot find the paper you are looking for? You can Submit a new open access paper.