no code implementations • 10 Feb 2022 • Max Cohen, Guillaume Quispe, Sylvain Le Corff, Charles Ollion, Eric Moulines
In this work, we propose a new model to train the prior and the encoder/decoder networks simultaneously.
no code implementations • 20 Sep 2021 • Alice Martin Donati, Guillaume Quispe, Charles Ollion, Sylvain Le Corff, Florian Strub, Olivier Pietquin
This paper introduces TRUncated ReinForcement Learning for Language (TrufLL), an original ap-proach to train conditional language models from scratch by only using reinforcement learning (RL).