no code implementations • 14 Sep 2020 • Kumiko Tanaka-Ishii, Shuntaro Takahashi
This article considers the fluctuation analysis methods of Taylor and Ebeling & Neiman.
no code implementations • CL 2019 • Shuntaro Takahashi, Kumiko Tanaka-Ishii
Statistical mechanical analyses have revealed that natural language text is characterized by scaling properties, which quantify the global structure in the vocabulary population and the long memory of a text.
no code implementations • 24 Apr 2018 • Shuntaro Takahashi, Kumiko Tanaka-Ishii
Five such tests are considered, with the first two accounting for the vocabulary population and the other three for the long memory of natural language.
no code implementations • 16 Jul 2017 • Shuntaro Takahashi, Kumiko Tanaka-Ishii
Precisely, we demonstrate that a neural language model based on long short-term memory (LSTM) effectively reproduces Zipf's law and Heaps' law, two representative statistical properties underlying natural language.