1 code implementation • 10 Nov 2023 • Uyen Lai, Gurjit S. Randhawa, Paul Sheridan
Heaps' law is an empirical relation in text analysis that predicts vocabulary growth as a function of corpus size.
1 code implementation • 24 Oct 2023 • Samuel Sarria Hurtado, Todd Mullen, Taku Onodera, Paul Sheridan
However, the document frequency of a term (i. e., the proportion of documents within a corpus in which a specific term occurs) is exploited by certain other widely used term burstiness measures.
1 code implementation • 26 Feb 2020 • Paul Sheridan, Mikael Onsjö
Term frequency-inverse document frequency, or TF-IDF for short, and its many variants form a class of term weighting functions the members of which are widely used in text analysis applications.
no code implementations • 1 May 2019 • Paul Sheridan, Mikael Onsjö, Janna Hastings
Literary theme identification and interpretation is a focal point of literary studies scholarship.
1 code implementation • 31 Jul 2018 • Paul Sheridan, Mikael Onsjö, Claudia Becerra, Sergio Jimenez, George Dueñas
As a study case, we evaluated the proposed method against other approaches by performing the classical rating prediction task on a collection of Star Trek television series episodes in an item cold-start scenario.
1 code implementation • 20 Apr 2017 • Thong Pham, Paul Sheridan, Hidetoshi Shimodaira
This paper introduces the R package PAFit, which implements non-parametric procedures for estimating the preferential attachment function and node fitnesses in a growing network, as well as a number of functions for generating complex networks from these two mechanisms.
Data Analysis, Statistics and Probability Social and Information Networks Physics and Society Computation