Search Results for author: Kumiko Tanaka-Ishii

Found 14 papers, 1 papers with code

Strahler Number of Natural Language Sentences in Comparison with Random Trees

no code implementations • 6 Jul 2023 • Kumiko Tanaka-Ishii, Akira Tanaka

The Strahler number was originally proposed to characterize the complexity of river bifurcation and has found various applications.

Sentence

Paper
Add Code

A Comparison of Two Fluctuation Analyses for Natural Language Clustering Phenomena: Taylor and Ebeling & Neiman Methods

no code implementations • 14 Sep 2020 • Kumiko Tanaka-Ishii, Shuntaro Takahashi

This article considers the fluctuation analysis methods of Taylor and Ebeling & Neiman.

Clustering

Paper
Add Code

Stock Embeddings Acquired from News Articles and Price History, and an Application to Portfolio Optimization

no code implementations • ACL 2020 • Xin Du, Kumiko Tanaka-Ishii

The stock embedding is acquired with a deep learning framework using both news articles and price history.

Portfolio Optimization

Paper
Add Code

Extraction of Templates from Phrases Using Sequence Binary Decision Diagrams

no code implementations • 28 Jan 2020 • Daiki Hirano, Kumiko Tanaka-Ishii, Andrew Finch

The extraction of templates such as ``regard X as Y'' from a set of related phrases requires the identification of their internal structures.

Paper
Add Code

Evaluating Computational Language Models with Scaling Properties of Natural Language

no code implementations • CL 2019 • Shuntaro Takahashi, Kumiko Tanaka-Ishii

Statistical mechanical analyses have revealed that natural language text is characterized by scaling properties, which quantify the global structure in the vocabulary population and the long memory of a text.

Text Generation

Paper
Add Code

Word Familiarity and Frequency

no code implementations • 9 Jun 2018 • Kumiko Tanaka-Ishii, Hiroshi Terada

Word frequency is assumed to correlate with word familiarity, but the strength of this correlation has not been thoroughly investigated.

Paper
Add Code

Assessing Language Models with Scaling Properties

no code implementations • 24 Apr 2018 • Shuntaro Takahashi, Kumiko Tanaka-Ishii

Five such tests are considered, with the first two accounting for the vocabulary population and the other three for the long memory of natural language.

Paper
Add Code

Taylor's law for Human Linguistic Sequences

1 code implementation • ACL 2018 • Tatsuru Kobayashi, Kumiko Tanaka-Ishii

Taylor's law describes the fluctuation characteristics underlying a system in which the variance of an event within a time span grows by a power law with respect to the mean.

Time Series Time Series Analysis

Paper
Code

Long-Range Correlation Underlying Childhood Language and Generative Models

no code implementations • 11 Dec 2017 • Kumiko Tanaka-Ishii

Long-range correlation, a property of time series exhibiting long-term memory, is mainly studied in the statistical physics domain and has been reported to exist in natural language.

Time Series Time Series Analysis

Paper
Add Code

Do Neural Nets Learn Statistical Laws behind Natural Language?

no code implementations • 16 Jul 2017 • Shuntaro Takahashi, Kumiko Tanaka-Ishii

Precisely, we demonstrate that a neural language model based on long short-term memory (LSTM) effectively reproduces Zipf's law and Heaps' law, two representative statistical properties underlying natural language.

Language Modelling

Paper
Add Code

Upper Bound of Entropy Rate Revisited ---A New Extrapolation of Compressed Large-Scale Corpora---

no code implementations • WS 2016 • Ryosuke Takahira, Kumiko Tanaka-Ishii, {\L}ukasz D{\k{e}}bowski

The article presents results of entropy rate estimation for human languages across six languages by using large, state-of-the-art corpora of up to 7. 8 gigabytes.

Paper
Add Code

Computational Constancy Measures of Texts---Yule's K and R\'enyi's Entropy

no code implementations • CL 2015 • Kumiko Tanaka-Ishii, Shunsuke Aihara

Paper
Add Code

Verb Temporality Analysis using Reichenbach's Tense System

no code implementations • COLING 2012 • Andr{\'e} Horie, Kumiko Tanaka-Ishii, Mitsuru Ishizuka

Machine Translation

Paper
Add Code

Text Segmentation by Language Using Minimum Description Length

no code implementations • ACL 2012 • Hiroshi Yamaguchi, Kumiko Tanaka-Ishii

Language Modelling Text Segmentation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.