Black Box Recursive Translations for Molecular Optimization

21 Dec 2019  ·  Farhan Damani, Vishnu Sresht, Stephen Ra ·

Machine learning algorithms for generating molecular structures offer a promising new approach to drug discovery. We cast molecular optimization as a translation problem, where the goal is to map an input compound to a target compound with improved biochemical properties. Remarkably, we observe that when generated molecules are iteratively fed back into the translator, molecular compound attributes improve with each step. We show that this finding is invariant to the choice of translation model, making this a "black box" algorithm. We call this method Black Box Recursive Translation (BBRT), a new inference method for molecular property optimization. This simple, powerful technique operates strictly on the inputs and outputs of any translation model. We obtain new state-of-the-art results for molecular property optimization tasks using our simple drop-in replacement with well-known sequence and graph-based models. Our method provides a significant boost in performance relative to its non-recursive peers with just a simple "for" loop. Further, BBRT is highly interpretable, allowing users to map the evolution of newly discovered compounds from known starting points.

PDF Abstract

Datasets


Results from the Paper


 Ranked #1 on Molecular Graph Generation on ZINC (QED Top-3 metric)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Molecular Graph Generation ZINC BBRT-Seq2Seq QED Top-3 0.948, 0.948, 0.948 # 1
PlogP Top-3 6.74, 6.47, 6,42 # 1
Molecular Graph Generation ZINC BBRT-JTNN QED Top-3 0.948, 0.948, 0.948 # 1
PlogP Top-3 10.13, 10.13, 9.91 # 1

Methods


No methods listed for this paper. Add relevant methods here