REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models

NeurIPS 2017 George TuckerAndriy MnihChris J. MaddisonDieterich LawsonJascha Sohl-Dickstein

Learning in models with discrete latent variables is challenging due to high variance gradient estimators. Generally, approaches have relied on control variates to reduce the variance of the REINFORCE estimator... (read more)

PDF Abstract

Evaluation Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.