Paper tables with annotated results for A Better Variant of Self-Critical Sequence Training

Paper

A Better Variant of Self-Critical Sequence Training

In this work, we present a simple yet better variant of Self-Critical Sequence Training. We make a simple change in the choice of baseline function in REINFORCE algorithm. The new baseline can bring better performance with no extra cost, compared to the greedy decoding baseline.

PDF Paper record

Results in Papers With Code

(↓ scroll down to see all results)

A Better Variant of Self-Critical Sequence Training

Reader Guidelines

Editor Guidelines