In this paper, we study the generative models of sequential discrete data. To tackle the exposure bias problem inherent in maximum likelihood estimation (MLE), generative adversarial networks (GANs) are introduced to penalize the unrealistic generated samples... (read more)
PDFMETHOD | TYPE | |
---|---|---|
![]() |
Policy Gradient Methods |