Search Results for author: Paul Sheridan

Found 6 papers, 5 papers with code

Heaps' Law in GPT-Neo Large Language Model Emulated Corpora

1 code implementation10 Nov 2023 Uyen Lai, Gurjit S. Randhawa, Paul Sheridan

Heaps' law is an empirical relation in text analysis that predicts vocabulary growth as a function of corpus size.

Language Modelling Large Language Model

A statistical significance testing approach for measuring term burstiness with applications to domain-specific terminology extraction

1 code implementation24 Oct 2023 Samuel Sarria Hurtado, Todd Mullen, Taku Onodera, Paul Sheridan

However, the document frequency of a term (i. e., the proportion of documents within a corpus in which a specific term occurs) is exploited by certain other widely used term burstiness measures.

Language Modelling

The hypergeometric test performs comparably to TF-IDF on standard text analysis tasks

1 code implementation26 Feb 2020 Paul Sheridan, Mikael Onsjö

Term frequency-inverse document frequency, or TF-IDF for short, and its many variants form a class of term weighting functions the members of which are widely used in text analysis applications.

Information Retrieval Retrieval +1

An Ontology-Based Recommender System with an Application to the Star Trek Television Franchise

1 code implementation31 Jul 2018 Paul Sheridan, Mikael Onsjö, Claudia Becerra, Sergio Jimenez, George Dueñas

As a study case, we evaluated the proposed method against other approaches by performing the classical rating prediction task on a collection of Star Trek television series episodes in an item cold-start scenario.

Collaborative Filtering Recommendation Systems +1

PAFit: an R Package for the Non-Parametric Estimation of Preferential Attachment and Node Fitness in Temporal Complex Networks

1 code implementation20 Apr 2017 Thong Pham, Paul Sheridan, Hidetoshi Shimodaira

This paper introduces the R package PAFit, which implements non-parametric procedures for estimating the preferential attachment function and node fitnesses in a growing network, as well as a number of functions for generating complex networks from these two mechanisms.

Data Analysis, Statistics and Probability Social and Information Networks Physics and Society Computation

Cannot find the paper you are looking for? You can Submit a new open access paper.