Search Results for author: Victor Gallego

Found 12 papers, 11 papers with code

Configurable Safety Tuning of Language Models with Synthetic Preference Data

1 code implementation • 30 Mar 2024 • Victor Gallego

State-of-the-art language model fine-tuning techniques, such as Direct Preference Optimization (DPO), restrict user control by hard-coding predefined behaviors into the model.

Language Modelling

Paper
Code

Distilled Self-Critique of LLMs with Synthetic Data: a Bayesian Perspective

1 code implementation • 4 Dec 2023 • Victor Gallego

This paper proposes an interpretation of RLAIF as Bayesian inference by introducing distilled Self-Critique (dSC), which refines the outputs of a LLM through a Gibbs sampler that is later distilled into a fine-tuned model.

Bayesian Inference

Paper
Code

ZYN: Zero-Shot Reward Models with Yes-No Questions for RLAIF

2 code implementations • 11 Aug 2023 • Victor Gallego

In this work, we address the problem of directing the text generation of a language model (LM) towards a desired behavior, aligning the generated text with the preferences of the human operator.

Attribute Language Modelling +1

Paper
Code

Fast Adaptation with Bradley-Terry Preference Models in Text-To-Image Classification and Generation

no code implementations • 15 Jul 2023 • Victor Gallego

Recently, large multimodal models, such as CLIP and Stable Diffusion have experimented tremendous successes in both foundations and applications.

Image Classification

Paper
Add Code

Personalizing Text-to-Image Generation via Aesthetic Gradients

1 code implementation • 25 Sep 2022 • Victor Gallego

This work proposes aesthetic gradients, a method to personalize a CLIP-conditioned diffusion model by guiding the generative process towards custom aesthetics defined by the user from a set of images.

Text-to-Image Generation

708

Paper
Code

Protecting Classifiers From Attacks. A Bayesian Approach

1 code implementation • 18 Apr 2020 • Victor Gallego, Roi Naveiro, Alberto Redondo, David Rios Insua, Fabrizio Ruggeri

Classification problems in security settings are usually modeled as confrontations in which an adversary tries to fool a classifier manipulating the covariates of instances to obtain a benefit.

Paper
Code

Adversarial Machine Learning: Bayesian Perspectives

1 code implementation • 7 Mar 2020 • David Rios Insua, Roi Naveiro, Victor Gallego, Jason Poulos

Adversarial Machine Learning (AML) is emerging as a major field aimed at protecting machine learning (ML) systems against security threats: in certain scenarios there may be adversaries that actively manipulate input data to fool learning systems.

Adversarial Robustness BIG-bench Machine Learning

Paper
Code

Variationally Inferred Sampling Through a Refined Bound

1 code implementation • pproximateinference AABI Symposium 2019 • Victor Gallego, David Rios Insua

A framework for efficient Bayesian inference in probabilistic programs is introduced by embedding a sampler inside a variational posterior approximation.

Bayesian Inference Density Estimation +2

Paper
Code

Variationally Inferred Sampling Through a Refined Bound for Probabilistic Programs

1 code implementation • 26 Aug 2019 • Victor Gallego, David Rios Insua

A framework to boost the efficiency of Bayesian inference in probabilistic programs is introduced by embedding a sampler inside a variational posterior approximation.

Bayesian Inference Density Estimation +2

Paper
Code

Opponent Aware Reinforcement Learning

1 code implementation • 22 Aug 2019 • Victor Gallego, Roi Naveiro, David Rios Insua, David Gomez-Ullate Oteiza

We introduce Threatened Markov Decision Processes (TMDPs) as an extension of the classical Markov Decision Process framework for Reinforcement Learning (RL).

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Stochastic Gradient MCMC with Repulsive Forces

2 code implementations • 30 Nov 2018 • Victor Gallego, David Rios Insua

We propose a unifying view of two different Bayesian inference algorithms, Stochastic Gradient Markov Chain Monte Carlo (SG-MCMC) and Stein Variational Gradient Descent (SVGD), leading to improved and efficient novel sampling schemes.

Bayesian Inference valid

Paper
Code

Reinforcement Learning under Threats

1 code implementation • 5 Sep 2018 • Victor Gallego, Roi Naveiro, David Rios Insua

In several reinforcement learning (RL) scenarios, mainly in security settings, there may be adversaries trying to interfere with the reward generating process.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.