Search Results for author: Christopher M. Danforth

Found 37 papers, 10 papers with code

The Resume Paradox: Greater Language Differences, Smaller Pay Gaps

no code implementations17 Jul 2023 Joshua R. Minot, Marc Maier, Bradford Demarest, Nicholas Cheney, Christopher M. Danforth, Peter Sheridan Dodds, Morgan R. Frank

This suggests that females' resumes that are semantically similar to males' resumes may have greater wage parity.

A blind spot for large language models: Supradiegetic linguistic information

no code implementations11 Jun 2023 Julia Witte Zimmerman, Denis Hudon, Kathryn Cramer, Jonathan St. Onge, Mikaela Fudolig, Milo Z. Trujillo, Christopher M. Danforth, Peter Sheridan Dodds

We propose that considering what it is like to be an LLM like ChatGPT, as Nagel might have put it, can help us gain insight into its capabilities in general, and in particular, that its exposure to linguistic training data can be productively reframed as exposure to the diegetic information encoded in language, and its deficits can be reframed as ignorance of extradiegetic information, including supradiegetic linguistic information.

An assessment of measuring local levels of homelessness through proxy social media signals

no code implementations15 May 2023 Yoshi Meke Bird, Sarah E. Grobe, Michael V. Arnold, Sean P. Rogers, Mikaela I. Fudolig, Julia Witte Zimmerman, Christopher M. Danforth, Peter Sheridan Dodds

An increase to the log-odds of ``homeless'' appearing in an English-language tweet, as well as an acceleration in the increase in average tweet sentiment, suggest that tweets about homelessness are also affected by trends at the nation-scale.

Curating corpora with classifiers: A case study of clean energy sentiment online

no code implementations4 May 2023 Michael V. Arnold, Peter Sheridan Dodds, Christopher M. Danforth

Both of these drawbacks could be overcome with a real-time, high volume data stream and fast analysis pipeline.

Binary Classification

A decomposition of book structure through ousiometric fluctuations in cumulative word-time

no code implementations19 Aug 2022 Mikaela Irene Fudolig, Thayer Alshaabi, Kathryn Cramer, Christopher M. Danforth, Peter Sheridan Dodds

Our findings suggest that, in the ousiometric sense, longer books are not expanded versions of shorter books, but are more similar in structure to a concatenation of shorter texts.

Denoising Time Series +1

Sentiment and structure in word co-occurrence networks on Twitter

no code implementations1 Oct 2021 Mikaela Irene Fudolig, Thayer Alshaabi, Michael V. Arnold, Christopher M. Danforth, Peter Sheridan Dodds

We explore the relationship between context and happiness scores in political tweets using word co-occurrence networks, where nodes in the network are the words, and the weight of an edge is the number of tweets in the corpus for which the two connected words co-occur.

Community Detection

Sirius: Visualization of Mixed Features as a Mutual Information Network Graph

1 code implementation9 Jun 2021 Jane L. Adams, Todd F. Deluca, Christopher M. Danforth, Peter S. Dodds, Yuhang Zheng, Konstantinos Anastasakis, Boyoon Choi, Allison Min, Michael M. Bessey

Data scientists across disciplines are increasingly in need of exploratory analysis tools for data sets with a high volume of features of mixed data type (quantitative continuous and discrete categorical).

Dimensionality Reduction feature selection +1

Quantifying language changes surrounding mental health on Twitter

no code implementations2 Jun 2021 Anne Marie Stupinski, Thayer Alshaabi, Michael V. Arnold, Jane Lydia Adams, Joshua R. Minot, Matthew Price, Peter Sheridan Dodds, Christopher M. Danforth

Mental health challenges are thought to afflict around 10% of the global population each year, with many going untreated due to stigma and limited access to services.

The incel lexicon: Deciphering the emergent cryptolect of a global misogynistic community

no code implementations25 May 2021 Kelly Gothard, David Rushing Dewhurst, Joshua R. Minot, Jane Lydia Adams, Christopher M. Danforth, Peter Sheridan Dodds

Evolving out of a gender-neutral framing of an involuntary celibate identity, the concept of `incels' has come to refer to an online community of men who bear antipathy towards themselves, women, and society-at-large for their perceived inability to find and maintain sexual relationships.

Generalized Word Shift Graphs: A Method for Visualizing and Explaining Pairwise Comparisons Between Texts

3 code implementations5 Aug 2020 Ryan J. Gallagher, Morgan R. Frank, Lewis Mitchell, Aaron J. Schwartz, Andrew J. Reagan, Christopher M. Danforth, Peter Sheridan Dodds

A common task in computational text analyses is to quantify how two corpora differ according to a measurement like word frequency, sentiment, or information content.

The sociospatial factors of death: Analyzing effects of geospatially-distributed variables in a Bayesian mortality model for Hong Kong

1 code implementation15 Jun 2020 Thayer Alshaabi, David Rushing Dewhurst, James P. Bagrow, Peter Sheridan Dodds, Christopher M. Danforth

However, the extent to which mortality in a geographical region is a function of socioeconomic factors in both that region and its neighbors is unclear.

Physics and Society Social and Information Networks Applications

Hahahahaha, Duuuuude, Yeeessss!: A two-parameter characterization of stretchable words and the dynamics of mistypings and misspellings

no code implementations9 Jul 2019 Tyler J. Gray, Christopher M. Danforth, Peter Sheridan Dodds

Stretched words like `heellllp' or `heyyyyy' are a regular feature of spoken language, often used to emphasize or exaggerate the underlying meaning of the root word.

The shocklet transform: A decomposition method for the identification of local, mechanism-driven dynamics in sociotechnical time series

2 code implementations27 Jun 2019 David Rushing Dewhurst, Thayer Alshaabi, Dilan Kiley, Michael V. Arnold, Joshua R. Minot, Christopher M. Danforth, Peter Sheridan Dodds

We introduce a qualitative, shape-based, timescale-independent time-domain transform used to extract local dynamics from sociotechnical time series---termed the Discrete Shocklet Transform (DST)---and an associated similarity search routine, the Shocklet Transform And Ranking (STAR) algorithm, that indicates time windows during which panels of time series display qualitatively-similar anomalous behavior.

Physics and Society Data Structures and Algorithms Signal Processing Data Analysis, Statistics and Probability

Scaling of inefficiencies in the U.S. equity markets: Evidence from three market indices and more than 2900 securities

no code implementations13 Feb 2019 John H. Ring IV, Colin M. Van Oort, David R. Dewhurst, Tyler J. Gray, Christopher M. Danforth, Brian F. Tivnan

Using the most comprehensive, commercially-available dataset of trading activity in U. S. equity markets, we catalog and analyze quote dislocations between the SIP National Best Bid and Offer (NBBO) and a synthetic BBO constructed from direct feeds.

Fragmentation and inefficiencies in US equity markets: Evidence from the Dow 30

no code implementations13 Feb 2019 Brian F. Tivnan, David Rushing Dewhurst, Colin M. Van Oort, John H. Ring IV, Tyler J. Gray, Brendan F. Tivnan, Matthew T. K. Koehler, Matthew T. McMahon, David Slater, Jason Veneman, Christopher M. Danforth

Using the most comprehensive source of commercially available data on the US National Market System, we analyze all quotes and trades associated with Dow 30 stocks in 2016 from the vantage point of a single and fixed frame of reference.

A Sentiment Analysis of Breast Cancer Treatment Experiences and Healthcare Perceptions Across Twitter

no code implementations25 May 2018 Eric M. Clark, Ted James, Chris A. Jones, Amulya Alapati, Promise Ukandu, Christopher M. Danforth, Peter Sheridan Dodds

Conclusions: Social media can provide a positive outlet for patients to discuss their needs and concerns regarding their healthcare coverage and treatment needs.

Decision Making Sentiment Analysis

English verb regularization in books and tweets

no code implementations26 Mar 2018 Tyler J. Gray, Andrew J. Reagan, Peter Sheridan Dodds, Christopher M. Danforth

We find that the extent of verb regularization is greater on Twitter, taken as a whole, than in English Fiction books.

Translation

Forecasting the onset and course of mental illness with Twitter data

1 code implementation27 Aug 2016 Andrew G. Reece, Andrew J. Reagan, Katharina L. M. Lix, Peter Sheridan Dodds, Christopher M. Danforth, Ellen J. Langer

Twitter data and details of depression history were collected from 204 individuals (105 depressed, 99 healthy).

Physics and Society Social and Information Networks

Instagram photos reveal predictive markers of depression

no code implementations10 Aug 2016 Andrew G. Reece, Christopher M. Danforth

Statistical features were computationally extracted from 43, 950 participant Instagram photos, using color analysis, metadata components, and algorithmic face detection.

Social and Information Networks Physics and Society

The emotional arcs of stories are dominated by six basic shapes

2 code implementations24 Jun 2016 Andrew J. Reagan, Lewis Mitchell, Dilan Kiley, Christopher M. Danforth, Peter Sheridan Dodds

Advances in computing power, natural language processing, and digitization of text now make it possible to study a culture's evolution through its texts using a "big data" lens.

Divergent discourse between protests and counter-protests: #BlackLivesMatter and #AllLivesMatter

no code implementations22 Jun 2016 Ryan J. Gallagher, Andrew J. Reagan, Christopher M. Danforth, Peter Sheridan Dodds

Since the shooting of Black teenager Michael Brown by White police officer Darren Wilson in Ferguson, Missouri, the protest hashtag #BlackLivesMatter has amplified critiques of extrajudicial killings of Black Americans.

What we write about when we write about causality: Features of causal statements across large-scale social discourse

no code implementations20 Apr 2016 Thomas C. McAndrew, Joshua C. Bongard, Christopher M. Danforth, Peter S. Dodds, Paul D. H. Hines, James P. Bagrow

Identifying and communicating relationships between causes and effects is important for understanding our world, but is affected by language structure, cognitive and emotional biases, and the properties of the communication medium.

Sentiment Analysis Topic Models

Benchmarking sentiment analysis methods for large-scale texts: A case for using continuum-scored words and word shift graphs

2 code implementations2 Dec 2015 Andrew J. Reagan, Brian Tivnan, Jake Ryland Williams, Christopher M. Danforth, Peter Sheridan Dodds

The emergence and global adoption of social media has rendered possible the real-time estimation of population-scale sentiment, bearing profound implications for our understanding of human behavior.

Benchmarking Sentiment Analysis

Nonlinear functional mapping of the human brain

no code implementations8 Sep 2015 Nicholas Allgaier, Tobias Banaschewski, Gareth Barker, Arun L. W. Bokde, Josh C. Bongard, Uli Bromberg, Christian Büchel, Anna Cattrell, Patricia J. Conrod, Christopher M. Danforth, Sylvane Desrivières, Peter S. Dodds, Herta Flor, Vincent Frouin, Jürgen Gallinat, Penny Gowland, Andreas Heinz, Bernd Ittermann, Scott Mackey, Jean-Luc Martinot, Kevin Murphy, Frauke Nees, Dimitri Papadopoulos-Orfanos, Luise Poustka, Michael N. Smolka, Henrik Walter, Robert Whelan, Gunter Schumann, Hugh Garavan, IMAGEN Consortium

In the present study, we introduce just such a method, called nonlinear functional mapping (NFM), and demonstrate its application in the analysis of resting state fMRI from a 242-subject subset of the IMAGEN project, a European study of adolescents that includes longitudinal phenotypic, behavioral, genetic, and neuroimaging data.

Sifting Robotic from Organic Text: A Natural Language Approach for Detecting Automation on Twitter

no code implementations17 May 2015 Eric M. Clark, Jake Ryland Williams, Chris A. Jones, Richard A. Galbraith, Christopher M. Danforth, Peter Sheridan Dodds

Twitter, a popular social media outlet, has evolved into a vast source of linguistic data, rich with opinion, sentiment, and discussion.

Is language evolution grinding to a halt? The scaling of lexical turbulence in English fiction suggests it is not

no code implementations11 Mar 2015 Eitan Adam Pechenick, Christopher M. Danforth, Peter Sheridan Dodds

Of basic interest is the quantification of the long term growth of a language's lexicon as it develops to more completely cover both a culture's communication requirements and knowledge space.

Identifying missing dictionary entries with frequency-conserving context models

no code implementations7 Mar 2015 Jake Ryland Williams, Eric M. Clark, James P. Bagrow, Christopher M. Danforth, Peter Sheridan Dodds

With our predictions we then engage the editorial community of the Wiktionary and propose short lists of potential missing entries for definition, developing a breakthrough, lexical extraction technique, and expanding our knowledge of the defined English lexicon of phrases.

Characterizing the Google Books corpus: Strong limits to inferences of socio-cultural and linguistic evolution

no code implementations5 Jan 2015 Eitan Adam Pechenick, Christopher M. Danforth, Peter Sheridan Dodds

However, the Google Books corpus suffers from a number of limitations which make it an obscure mask of cultural popularity.

Zipf's law holds for phrases, not words

no code implementations19 Jun 2014 Jake Ryland Williams, Paul R. Lessard, Suma Desu, Eric Clark, James P. Bagrow, Christopher M. Danforth, Peter Sheridan Dodds

With Zipf's law being originally and most famously observed for word frequency, it is surprisingly limited in its applicability to human language, holding over no more than three to four orders of magnitude before hitting a clear break in scaling.

Human language reveals a universal positivity bias

no code implementations15 Jun 2014 Peter Sheridan Dodds, Eric M. Clark, Suma Desu, Morgan R. Frank, Andrew J. Reagan, Jake Ryland Williams, Lewis Mitchell, Kameron Decker Harris, Isabel M. Kloumann, James P. Bagrow, Karine Megerdoomian, Matthew T. McMahon, Brian F. Tivnan, Christopher M. Danforth

Using human evaluation of 100, 000 words spread across 24 corpora in 10 languages diverse in origin and culture, we present evidence of a deep imprint of human sociality in language, observing that (1) the words of natural human language possess a universal positivity bias; (2) the estimated emotional content of words is consistent between languages under translation; and (3) this positivity bias is strongly independent of frequency of word usage.

Cultural Vocal Bursts Intensity Prediction Translation

Cannot find the paper you are looking for? You can Submit a new open access paper.