The Lenta Short Sentences dataset is a text dataset for language modelling for the Russian language. It consists of 236K sentences sampled from the Lenta News dataset.

Source: https://arxiv.org/pdf/2005.02470.pdf

Papers


Paper Code Results Date Stars

Tasks


License


  • Unknown

Modalities


Languages