The Natural Stories dataset consists of English texts edited to contain many low-frequency syntactic constructions while still sounding fluent to native speakers. The corpus is annotated with hand-corrected parse trees and includes self-paced reading time data.
Source: The Natural Stories CorpusPaper | Code | Results | Date | Stars |
---|