Search Results for author: Jean-Francois Ton

Found 16 papers, 4 papers with code

Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation

no code implementations8 Mar 2024 Xiaoying Zhang, Jean-Francois Ton, Wei Shen, Hongning Wang, Yang Liu

We introduce Adversarial Policy Optimization (AdvPO), a novel solution to the pervasive issue of reward over-optimization in Reinforcement Learning from Human Feedback (RLHF) for Large Language Models (LLMs).

Dataset Fairness: Achievable Fairness on Your Data With Utility Guarantees

no code implementations27 Feb 2024 Muhammad Faaiz Taufiq, Jean-Francois Ton, Yang Liu

In machine learning fairness, training models which minimize disparity across different sensitive groups often leads to diminished accuracy, a phenomenon known as the fairness-accuracy trade-off.

Fairness

Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting

no code implementations16 Feb 2024 Jiaheng Wei, Yuanshun Yao, Jean-Francois Ton, Hongyi Guo, Andrew Estornell, Yang Liu

In this work, we propose Factualness Evaluations via Weighting LLMs (FEWL), the first hallucination metric that is specifically designed for the scenario when gold-standard answers are absent.

Hallucination In-Context Learning

Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits

1 code implementation NeurIPS 2023 Muhammad Faaiz Taufiq, Arnaud Doucet, Rob Cornish, Jean-Francois Ton

Off-Policy Evaluation (OPE) in contextual bandits is crucial for assessing new policies using existing data without costly experimentation.

Causal Inference Multi-Armed Bandits +1

Deep Concept Removal

no code implementations9 Oct 2023 Yegor Klochkov, Jean-Francois Ton, Ruocheng Guo, Yang Liu, Hang Li

We address the problem of concept removal in deep neural networks, aiming to learn representations that do not encode certain specified concepts (e. g., gender etc.)

Attribute Out-of-Distribution Generalization

Invariant Learning via Probability of Sufficient and Necessary Causes

1 code implementation NeurIPS 2023 Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jianhong Wang, Jun Wang

To capture the information of sufficient and necessary causes, we employ a classical concept, the probability of sufficiency and necessary causes (PNS), which indicates the probability of whether one is the necessary and sufficient cause.

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

1 code implementation10 Aug 2023 Yang Liu, Yuanshun Yao, Jean-Francois Ton, Xiaoying Zhang, Ruocheng Guo, Hao Cheng, Yegor Klochkov, Muhammad Faaiz Taufiq, Hang Li

However, a major challenge faced by practitioners is the lack of clear guidance on evaluating whether LLM outputs align with social norms, values, and regulations.

Fairness Models Alignment

Conformal Off-Policy Prediction in Contextual Bandits

no code implementations9 Jun 2022 Muhammad Faaiz Taufiq, Jean-Francois Ton, Rob Cornish, Yee Whye Teh, Arnaud Doucet

Most off-policy evaluation methods for contextual bandits have focused on the expected outcome of a policy, which is estimated via methods that at best provide only asymptotic guarantees.

Conformal Prediction Multi-Armed Bandits +1

Regularized Training of Nearest Neighbor Language Models

no code implementations NAACL (ACL) 2022 Jean-Francois Ton, Walter Talbott, Shuangfei Zhai, Josh Susskind

In particular, we find that the added L2 regularization seems to improve the performance for high-frequency words without deteriorating the performance for low frequency ones.

L2 Regularization Language Modelling

Meta Learning for Causal Direction

no code implementations6 Jul 2020 Jean-Francois Ton, Dino Sejdinovic, Kenji Fukumizu

Based on recent developments in meta learning as well as in causal inference, we introduce a novel generative model that allows distinguishing cause and effect in the small data setting.

Causal Inference Meta-Learning

Robust Pruning at Initialization

no code implementations ICLR 2021 Soufiane Hayou, Jean-Francois Ton, Arnaud Doucet, Yee Whye Teh

Overparameterized Neural Networks (NN) display state-of-the-art performance.

MetaFun: Meta-Learning with Iterative Functional Updates

1 code implementation ICML 2020 Jin Xu, Jean-Francois Ton, Hyunjik Kim, Adam R. Kosiorek, Yee Whye Teh

We develop a functional encoder-decoder approach to supervised meta-learning, where labeled data is encoded into an infinite-dimensional functional representation rather than a finite-dimensional one.

Few-Shot Image Classification Meta-Learning

Noise Contrastive Meta-Learning for Conditional Density Estimation using Kernel Mean Embeddings

no code implementations5 Jun 2019 Jean-Francois Ton, Lucian Chan, Yee Whye Teh, Dino Sejdinovic

Current meta-learning approaches focus on learning functional representations of relationships between variables, i. e. on estimating conditional expectations in regression.

Density Estimation Meta-Learning +1

Automated Model Selection with Bayesian Quadrature

no code implementations26 Feb 2019 Henry Chai, Jean-Francois Ton, Roman Garnett, Michael A. Osborne

We present a novel technique for tailoring Bayesian quadrature (BQ) to model selection.

Model Selection

Towards A Unified Analysis of Random Fourier Features

no code implementations24 Jun 2018 Zhu Li, Jean-Francois Ton, Dino Oglic, Dino Sejdinovic

We study both the standard random Fourier features method for which we improve the existing bounds on the number of features required to guarantee the corresponding minimax risk convergence rate of kernel ridge regression, as well as a data-dependent modification which samples features proportional to \emph{ridge leverage scores} and further reduces the required number of features.

Cannot find the paper you are looking for? You can Submit a new open access paper.