High-Dimensional Bayesian Optimisation with Variational Autoencoders and Deep Metric Learning

We introduce a method combining variational autoencoders (VAEs) and deep metric learning to perform Bayesian optimisation (BO) over high-dimensional and structured input spaces. By adapting ideas from deep metric learning, we use label guidance from the blackbox function to structure the VAE latent space, facilitating the Gaussian process fit and yielding improved BO performance. Importantly for BO problem settings, our method operates in semi-supervised regimes where only few labelled data points are available. We run experiments on three real-world tasks, achieving state-of-the-art results on the penalised logP molecule generation benchmark using just 3% of the labelled data required by previous approaches. As a theoretical contribution, we present a proof of vanishing regret for VAE BO.

PDF Abstract

Datasets


Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Molecular Graph Generation ZINC T-LBO-2 Reconstruction 76.7% # 1
Validty 100% # 1
PlogP Top-3 34.83, 31.1, 29.21 # 1
function evaluations 3450 # 3
Molecular Graph Generation ZINC T-LBO-1 Reconstruction 76.7% # 1
Validty 100% # 1
PlogP Top-3 22.84, 24.06, 21.26 # 1
function evaluations 2300 # 1
Molecular Graph Generation ZINC T-LBO-3 Reconstruction 76.7% # 1
Validty 100% # 1
PlogP Top-3 38.57, 34.83, 34.63 # 1
function evaluations 7750 # 4

Methods