Search Results for author: Grégoire Mialon

Found 10 papers, 9 papers with code

WorldSense: A Synthetic Benchmark for Grounded Reasoning in Large Language Models

1 code implementation27 Nov 2023 Youssef Benchekroun, Megi Dervishi, Mark Ibrahim, Jean-Baptiste Gaya, Xavier Martinet, Grégoire Mialon, Thomas Scialom, Emmanuel Dupoux, Dieuwke Hupkes, Pascal Vincent

We propose WorldSense, a benchmark designed to assess the extent to which LLMs are consistently able to sustain tacit world models, by testing how they draw simple inferences from descriptions of simple arrangements of entities.

In-Context Learning

GAIA: a benchmark for General AI Assistants

1 code implementation21 Nov 2023 Grégoire Mialon, Clémentine Fourrier, Craig Swift, Thomas Wolf, Yann Lecun, Thomas Scialom

GAIA's philosophy departs from the current trend in AI benchmarks suggesting to target tasks that are ever more difficult for humans.

Philosophy

Self-Supervised Learning with Lie Symmetries for Partial Differential Equations

1 code implementation NeurIPS 2023 Grégoire Mialon, Quentin Garrido, Hannah Lawrence, Danyal Rehman, Yann Lecun, Bobak T. Kiani

Machine learning for differential equations paves the way for computationally efficient alternatives to numerical solvers, with potentially broad impacts in science and engineering.

Representation Learning Self-Supervised Learning

On Inductive Biases for Machine Learning in Data Constrained Settings

1 code implementation21 Feb 2023 Grégoire Mialon

More precisely, we study the problem of safe sample screening, i. e, executing simple tests to discard uninformative samples from a dataset even before fitting a machine learning model, without affecting the optimal model.

Transfer Learning

Variance Covariance Regularization Enforces Pairwise Independence in Self-Supervised Representations

no code implementations29 Sep 2022 Grégoire Mialon, Randall Balestriero, Yann Lecun

Self-Supervised Learning (SSL) methods such as VICReg, Barlow Twins or W-MSE avoid collapse of their joint embedding architectures by constraining or regularizing the covariance matrix of their projector's output.

Domain Generalization Self-Supervised Learning

GraphiT: Encoding Graph Structure in Transformers

1 code implementation10 Jun 2021 Grégoire Mialon, Dexiong Chen, Margot Selosse, Julien Mairal

We show that viewing graphs as sets of node features and incorporating structural and positional information into a transformer architecture is able to outperform representations learned with classical graph neural networks (GNNs).

A Trainable Optimal Transport Embedding for Feature Aggregation and its Relationship to Attention

1 code implementation ICLR 2021 Grégoire Mialon, Dexiong Chen, Alexandre d'Aspremont, Julien Mairal

We address the problem of learning on sets of features, motivated by the need of performing pooling operations in long biological sequences of varying sizes, with long-range dependencies, and possibly few labeled data.

Screening Data Points in Empirical Risk Minimization via Ellipsoidal Regions and Safe Loss Functions

1 code implementation5 Dec 2019 Grégoire Mialon, Alexandre d'Aspremont, Julien Mairal

We design simple screening tests to automatically discard data samples in empirical risk minimization without losing optimization guarantees.

regression

A Kernel Perspective for Regularizing Deep Neural Networks

1 code implementation30 Sep 2018 Alberto Bietti, Grégoire Mialon, Dexiong Chen, Julien Mairal

We propose a new point of view for regularizing deep neural networks by using the norm of a reproducing kernel Hilbert space (RKHS).

Cannot find the paper you are looking for? You can Submit a new open access paper.