Search Results for author: James Hensman

Found 40 papers, 15 papers with code

QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs

1 code implementation • 30 Mar 2024 • Saleh Ashkboos, Amirkeivan Mohtashami, Maximilian L. Croci, Bo Li, Martin Jaggi, Dan Alistarh, Torsten Hoefler, James Hensman

We introduce QuaRot, a new Quantization scheme based on Rotations, which is able to quantize LLMs end-to-end, including all weights, activations, and KV cache in 4 bits.

Quantization

132

Paper
Code

Structured Entity Extraction Using Large Language Models

no code implementations • 6 Feb 2024 • Haolun Wu, Ye Yuan, Liana Mikaelyan, Alexander Meulemans, Xue Liu, James Hensman, Bhaskar Mitra

Recent advances in machine learning have significantly impacted the field of information extraction, with Large Language Models (LLMs) playing a pivotal role in extracting structured information from unstructured text.

Paper
Add Code

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

1 code implementation • 26 Jan 2024 • Saleh Ashkboos, Maximilian L. Croci, Marcelo Gennari do Nascimento, Torsten Hoefler, James Hensman

Large language models have become the cornerstone of natural language processing, but their use comes with substantial costs in terms of compute and memory resources.

289

Paper
Code

DiSK: A Diffusion Model for Structured Knowledge

no code implementations • 8 Dec 2023 • Ouail Kitouni, Niklas Nolte, James Hensman, Bhaskar Mitra

We introduce Diffusion Models of Structured Knowledge (DiSK) - a new architecture and training approach specialized for structured data.

Imputation Inductive Bias

Paper
Add Code

Sparse Gaussian Processes with Spherical Harmonic Features Revisited

no code implementations • 28 Mar 2023 • Stefanos Eleftheriadis, Dominic Richards, James Hensman

Further, we introduce sparseness in the eigenbasis by variational learning of the spherical harmonic phases.

Gaussian Processes

Paper
Add Code

Additive Gaussian Processes Revisited

1 code implementation • 20 Jun 2022 • Xiaoyu Lu, Alexis Boukouvalas, James Hensman

Gaussian Process (GP) models are a class of flexible non-parametric models that have rich representational power.

Gaussian Processes

Paper
Code

Improved Inverse-Free Variational Bounds for Sparse Gaussian Processes

no code implementations • pproximateinference AABI Symposium 2022 • Mark van der Wilk, Artem Artemev, James Hensman

The need for matrix decompositions (inverses) is often named as a major impediment to scaling Gaussian process (GP) models, even in efficient approximations.

Gaussian Processes

Paper
Add Code

Deep Neural Networks as Point Estimates for Deep Gaussian Processes

no code implementations • NeurIPS 2021 • Vincent Dutordoir, James Hensman, Mark van der Wilk, Carl Henrik Ek, Zoubin Ghahramani, Nicolas Durrande

This results in models that can either be seen as neural networks with improved uncertainty prediction or deep Gaussian processes with increased prediction accuracy.

Bayesian Inference Gaussian Processes +1

Paper
Add Code

GPflux: A Library for Deep Gaussian Processes

1 code implementation • 12 Apr 2021 • Vincent Dutordoir, Hugh Salimbeni, Eric Hambro, John McLeod, Felix Leibfried, Artem Artemev, Mark van der Wilk, James Hensman, Marc P. Deisenroth, ST John

GPflux is compatible with and built on top of the Keras deep learning eco-system.

Gaussian Processes

118

Paper
Code

Sparse Gaussian Processes with Spherical Harmonic Features

no code implementations • ICML 2020 • Vincent Dutordoir, Nicolas Durrande, James Hensman

We introduce a new class of inter-domain variational Gaussian processes (GP) where data is mapped onto the unit hypersphere in order to use spherical harmonic representations.

Gaussian Processes

Paper
Add Code

Amortized variance reduction for doubly stochastic objectives

no code implementations • 9 Mar 2020 • Ayman Boustati, Sattar Vakili, James Hensman, ST John

Approximate inference in complex probabilistic models such as deep Gaussian processes requires the optimisation of doubly stochastic objective functions.

Gaussian Processes

Paper
Add Code

A Framework for Interdomain and Multioutput Gaussian Processes

1 code implementation • 2 Mar 2020 • Mark van der Wilk, Vincent Dutordoir, ST John, Artem Artemev, Vincent Adam, James Hensman

One obstacle to the use of Gaussian processes (GPs) in large-scale problems, and as a component in deep learning system, is the need for bespoke derivations and implementations for small variations in the model or inference.

Gaussian Processes

1,798

Paper
Code

Doubly Sparse Variational Gaussian Processes

no code implementations • 15 Jan 2020 • Vincent Adam, Stefanos Eleftheriadis, Nicolas Durrande, Artem Artemev, James Hensman

The use of Gaussian process models is typically limited to datasets with a few tens of thousands of observations due to their complexity and memory footprint.

Gaussian Processes valid

Paper
Add Code

Variational Gaussian Process Models without Matrix Inverses

no code implementations • pproximateinference AABI Symposium 2019 • Mark van der Wilk, ST John, Artem Artemev, James Hensman

We present a variational approximation for a wide range of GP models that does not require a matrix inverse to be performed at each optimisation step.

Paper
Add Code

Overcoming Mean-Field Approximations in Recurrent Gaussian Process Models

1 code implementation • 13 Jun 2019 • Alessandro Davide Ialongo, Mark van der Wilk, James Hensman, Carl Edward Rasmussen

As we demonstrate in our experiments, the factorisation between latent system states and transition function can lead to a miscalibrated posterior and to learning unnecessarily large noise terms.

Variational Inference

Paper
Code

Deep Gaussian Processes with Importance-Weighted Variational Inference

1 code implementation • 14 May 2019 • Hugh Salimbeni, Vincent Dutordoir, James Hensman, Marc Peter Deisenroth

Deep Gaussian processes (DGPs) can model complex marginal densities as well as complex mappings.

Gaussian Processes Variational Inference

Paper
Code

Banded Matrix Operators for Gaussian Markov Models in the Automatic Differentiation Era

no code implementations • 26 Feb 2019 • Nicolas Durrande, Vincent Adam, Lucas Bordeaux, Stefanos Eleftheriadis, James Hensman

Banded matrices can be used as precision matrices in several models including linear state-space models, some Gaussian processes, and Gaussian Markov random fields.

Gaussian Processes Variational Inference

Paper
Add Code

Bayesian Image Classification with Deep Convolutional Gaussian Processes

no code implementations • 15 Feb 2019 • Vincent Dutordoir, Mark van der Wilk, Artem Artemev, James Hensman

We also demonstrate that our fully Bayesian approach improves on dropout-based Bayesian deep learning methods in terms of uncertainty and marginal likelihood estimates.

Classification Decision Making +5

Paper
Add Code

Non-Factorised Variational Inference in Dynamical Systems

no code implementations • 14 Dec 2018 • Alessandro Davide Ialongo, Mark van der Wilk, James Hensman, Carl Edward Rasmussen

We focus on variational inference in dynamical systems where the discrete time transition function (or evolution rule) is modelled by a Gaussian process.

Variational Inference

Paper
Add Code

Infinite-Horizon Gaussian Processes

1 code implementation • NeurIPS 2018 • Arno Solin, James Hensman, Richard E. Turner

The complexity is still cubic in the state dimension $m$ which is an impediment to practical application.

Gaussian Processes

Paper
Code

Gaussian Process Conditional Density Estimation

no code implementations • NeurIPS 2018 • Vincent Dutordoir, Hugh Salimbeni, Marc Deisenroth, James Hensman

Conditional Density Estimation (CDE) models deal with estimating conditional distributions.

Density Estimation Few-Shot Learning +2

Paper
Add Code

Learning Invariances using the Marginal Likelihood

no code implementations • NeurIPS 2018 • Mark van der Wilk, Matthias Bauer, ST John, James Hensman

Generalising well in supervised learning tasks relies on correctly extrapolating the training data to a large region of the input space.

Data Augmentation Gaussian Processes +2

Paper
Add Code

Large-Scale Cox Process Inference using Variational Fourier Features

no code implementations • ICML 2018 • S. T. John, James Hensman

This allows us to formulate a grid-free approximation that scales well with the number of data points and the size of the domain.

Small Data Image Classification

Paper
Add Code

Natural Gradients in Practice: Non-Conjugate Variational Inference in Gaussian Process Models

no code implementations • 24 Mar 2018 • Hugh Salimbeni, Stefanos Eleftheriadis, James Hensman

The natural gradient method has been used effectively in conjugate Gaussian process models, but the non-conjugate case has been largely unexplored.

Variational Inference

Paper
Add Code

Convolutional Gaussian Processes

4 code implementations • NeurIPS 2017 • Mark van der Wilk, Carl Edward Rasmussen, James Hensman

We present a practical way of introducing convolutional structure into Gaussian processes, making them more suited to high-dimensional inputs like images.

Gaussian Processes

8,363

Paper
Code

Pseudo-extended Markov chain Monte Carlo

1 code implementation • NeurIPS 2019 • Christopher Nemeth, Fredrik Lindsten, Maurizio Filippone, James Hensman

In this paper, we introduce the pseudo-extended MCMC method as a simple approach for improving the mixing of the MCMC sampler for multi-modal posterior distributions.

Paper
Code

Scalable Joint Models for Reliable Uncertainty-Aware Event Prediction

no code implementations • 16 Aug 2017 • Hossein Soleimani, James Hensman, Suchi Saria

Alternatively, state-of-the-art joint modeling techniques can be used for jointly modeling the longitudinal and event data and compute event probabilities conditioned on the longitudinal observations.

Gaussian Processes Imputation +2

Paper
Add Code

Identification of Gaussian Process State Space Models

no code implementations • NeurIPS 2017 • Stefanos Eleftheriadis, Thomas F. W. Nicholson, Marc Peter Deisenroth, James Hensman

To address this challenge, we impose a structured Gaussian variational posterior distribution over the latent states, which is parameterised by a recognition model in the form of a bi-directional recurrent neural network.

Paper
Add Code

Variational Fourier features for Gaussian processes

1 code implementation • 21 Nov 2016 • James Hensman, Nicolas Durrande, Arno Solin

This work brings together two powerful concepts in Gaussian processes: the variational approach to sparse approximation and the spectral representation of Gaussian processes.

Gaussian Processes

Paper
Code

GPflow: A Gaussian process library using TensorFlow

1 code implementation • 27 Oct 2016 • Alexander G. de G. Matthews, Mark van der Wilk, Tom Nickson, Keisuke Fujii, Alexis Boukouvalas, Pablo León-Villagrá, Zoubin Ghahramani, James Hensman

GPflow is a Gaussian process library that uses TensorFlow for its core computations and Python for its front end.

Gaussian Processes Variational Inference

1,798

Paper
Code

Chained Gaussian Processes

1 code implementation • 18 Apr 2016 • Alan D. Saul, James Hensman, Aki Vehtari, Neil D. Lawrence

Gaussian process models are flexible, Bayesian non-parametric approaches to regression.

Additive models Gaussian Processes

Paper
Code

MCMC for Variationally Sparse Gaussian Processes

no code implementations • NeurIPS 2015 • James Hensman, Alexander G. de G. Matthews, Maurizio Filippone, Zoubin Ghahramani

This paper simultaneously addresses these, using a variational approximation to the posterior which is sparse in support of the function but otherwise free-form.

Gaussian Processes

Paper
Add Code

Spike and Slab Gaussian Process Latent Variable Models

no code implementations • 10 May 2015 • Zhenwen Dai, James Hensman, Neil Lawrence

The Gaussian process latent variable model (GP-LVM) is a popular approach to non-linear probabilistic dimensionality reduction.

Dimensionality Reduction Gaussian Processes +2

Paper
Add Code

On Sparse variational methods and the Kullback-Leibler divergence between stochastic processes

no code implementations • 27 Apr 2015 • Alexander G. de G. Matthews, James Hensman, Richard E. Turner, Zoubin Ghahramani

We then discuss augmented index sets and show that, contrary to previous works, marginal consistency of augmentation is not enough to guarantee consistency of variational inference with the original model.

Variational Inference

Paper
Add Code

Nested Variational Compression in Deep Gaussian Processes

no code implementations • 3 Dec 2014 • James Hensman, Neil D. Lawrence

Deep Gaussian processes provide a flexible approach to probabilistic modelling of data using either supervised or unsupervised learning.

Gaussian Processes Variational Inference

Paper
Add Code

Scalable Variational Gaussian Process Classification

1 code implementation • 7 Nov 2014 • James Hensman, Alex Matthews, Zoubin Ghahramani

Gaussian process classification is a popular method with a number of appealing properties.

Classification General Classification

Paper
Code

Gaussian Process Models with Parallelization and GPU acceleration

no code implementations • 18 Oct 2014 • Zhenwen Dai, Andreas Damianou, James Hensman, Neil Lawrence

In this work, we present an extension of Gaussian process (GP) models with sophisticated parallelization and GPU acceleration.

Paper
Add Code

Fast nonparametric clustering of structured time-series

no code implementations • 8 Jan 2014 • James Hensman, Magnus Rattray, Neil D. Lawrence

In this publication, we combine two Bayesian non-parametric models: the Gaussian Process (GP) and the Dirichlet Process (DP).

Clustering Nonparametric Clustering +3

Paper
Add Code

Gaussian Processes for Big Data

8 code implementations • 26 Sep 2013 • James Hensman, Nicolo Fusi, Neil D. Lawrence

We introduce stochastic variational inference for Gaussian process models.

Gaussian Processes Variational Inference

3,407

Paper
Code

Fast Variational Inference in the Conjugate Exponential Family

no code implementations • NeurIPS 2012 • James Hensman, Magnus Rattray, Neil D. Lawrence

We present a general method for deriving collapsed variational inference algorithms for probabilistic models in the conjugate exponential family.

Variational Inference

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.