Search Results for author: Gregory Yauney

Found 8 papers, 5 papers with code

Stronger Random Baselines for In-Context Learning

2 code implementations19 Apr 2024 Gregory Yauney, David Mimno

Evaluating the in-context learning classification performance of language models poses challenges due to small dataset sizes, extensive prompt-selection using the validation set, and intentionally difficult tasks that lead to near-random performance.

In-Context Learning

The Afterlives of Shakespeare and Company in Online Social Readership

1 code implementation14 Jan 2024 Maria Antoniak, David Mimno, Rosamond Thalken, Melanie Walsh, Matthew Wilkens, Gregory Yauney

Meanwhile, the digitization of the lending library records of Shakespeare and Company provides a window into the reading activity of an earlier, smaller community in interwar Paris.

Data Similarity is Not Enough to Explain Language Model Performance

1 code implementation15 Nov 2023 Gregory Yauney, Emily Reif, David Mimno

Large language models achieve high performance on many but not all downstream tasks.

Language Modelling

A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity

no code implementations22 May 2023 Shayne Longpre, Gregory Yauney, Emily Reif, Katherine Lee, Adam Roberts, Barret Zoph, Denny Zhou, Jason Wei, Kevin Robinson, David Mimno, Daphne Ippolito

Second, we explore the effect of quality and toxicity filters, showing a trade-off between performance on standard benchmarks and risk of toxic generations.

Comparing Text Representations: A Theory-Driven Approach

1 code implementation EMNLP 2021 Gregory Yauney, David Mimno

Much of the progress in contemporary NLP has come from learning representations, such as masked language model (MLM) contextual embeddings, that turn challenging problems into simple classification tasks.

Language Modelling Learning Theory +1

Domain-Specific Lexical Grounding in Noisy Visual-Textual Documents

1 code implementation EMNLP 2020 Gregory Yauney, Jack Hessel, David Mimno

Images can give us insights into the contextual meanings of words, but current image-text grounding approaches require detailed annotations.

Clustering object-detection +2

Computational Histological Staining and Destaining of Prostate Core Biopsy RGB Images with Generative Adversarial Neural Networks

no code implementations26 Oct 2018 Aman Rana, Gregory Yauney, Alarice Lowe, Pratik Shah

The staining model uses a conditional generative adversarial network that learns hierarchical non-linear mappings between whole slide RGB image (WSRI) pairs of prostate core biopsy before and after H&E staining.

Generative Adversarial Network

Automated Process Incorporating Machine Learning Segmentation and Correlation of Oral Diseases with Systemic Health

no code implementations25 Oct 2018 Gregory Yauney, Aman Rana, Lawrence C. Wong, Perikumar Javia, Ali Muftu, Pratik Shah

We report an automated process that combines intraoral fluorescent porphyrin biomarker imaging, clinical examinations and machine learning for correlation of systemic health conditions with periodontal disease.

BIG-bench Machine Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.