no code implementations • LREC 2016 • Michal K{\v{r}}en, V{\'a}clav Cvr{\v{c}}ek, Tom{\'a}{\v{s}} {\v{C}}apka, Anna {\v{C}}erm{\'a}kov{\'a}, Milena Hn{\'a}tkov{\'a}, Lucie Chlumsk{\'a}, Tom{\'a}{\v{s}} Jel{\'\i}nek, Dominika Kov{\'a}{\v{r}}{\'\i}kov{\'a}, Vladim{\'\i}r Petkevi{\v{c}}, Pavel Proch{\'a}zka, Hana Skoumalov{\'a}, Michal {\v{S}}krabal, Petr Trune{\v{c}}ek, Pavel Vond{\v{r}}i{\v{c}}ka, Adrian Jan Zasina
The paper concentrates on the design, composition and annotation of SYN2015, a new 100-million representative corpus of contemporary written Czech.