Search Results for author: Robert West

Found 37 papers, 24 papers with code

Better than Average: Paired Evaluation of NLP systems

no code implementations ACL 2021 Maxime Peyrard, Wei Zhao, Steffen Eger, Robert West

Evaluation in NLP is usually done by comparing the scores of competing systems independently averaged over a common set of test instances.

Laughing Heads: Can Transformers Detect What Makes a Sentence Funny?

1 code implementation19 May 2021 Maxime Peyrard, Beatriz Borges, Kristina Gligorić, Robert West

We make progress in both respects by training and analyzing transformer-based humor recognition models on a recently introduced dataset consisting of minimal pairs of aligned sentences, one serious, the other humorous.

Low-rank Subspaces for Unsupervised Entity Linking

1 code implementation18 Apr 2021 Akhil Arora, Alberto Garcia-Duran, Robert West

Geometrically speaking, when representing entities as vectors via some given embedding, the gold entities tend to lie in a low-rank subspace of the full embedding space.

Entity Linking

Broccoli: Sprinkling Lightweight Vocabulary Learning into Everyday Information Diets

1 code implementation16 Apr 2021 Roland Aydin, Lars Klein, Arnaud Miribel, Robert West

Thus, by seeing words in context, the user can assimilate new vocabulary without much conscious effort.

Language Acquisition

Are Anti-Feminist Communities Gateways to the Far Right? Evidence from Reddit and YouTube

no code implementations25 Feb 2021 Robin Mamié, Manoel Horta Ribeiro, Robert West

Our results suggest that there is a large overlap between the user bases of the Alt-right and of the Manosphere and that members of the Manosphere have a bigger chance to engage with far right content than carefully chosen counterparts.

Computers and Society

Volunteer contributions to Wikipedia increased during COVID-19 mobility restrictions

1 code implementation19 Feb 2021 Thorsten Ruprechter, Manoel Horta Ribeiro, Tiago Santos, Florian Lemmerich, Markus Strohmaier, Robert West, Denis Helic

Wikipedia, the largest encyclopedia ever created, is a global initiative driven by volunteer contributions.

Computers and Society

Formation of Social Ties Influences Food Choice: A Campus-Wide Longitudinal Study

no code implementations17 Feb 2021 Kristina Gligorić, Ryen W. White, Emre Kiciman, Eric Horvitz, Arnaud Chiolero, Robert West

To estimate causal effects from the passively observed log data, we control confounds in a matched quasi-experimental design: we identify focal users who at first do not have any regular eating partners but then start eating with a fixed partner regularly, and we match focal users into comparison pairs such that paired users are nearly identical with respect to covariates measured before acquiring the partner, where the two focal users' new eating partners diverge in the healthiness of their respective food choice.

Experimental Design

YouNiverse: Large-Scale Channel and Video Metadata from English-Speaking YouTube

1 code implementation18 Dec 2020 Manoel Horta Ribeiro, Robert West

YouTube plays a key role in entertaining and informing people around the globe.

Time Series Social and Information Networks Computers and Society

KLearn: Background Knowledge Inference from Summarization Data

1 code implementation Findings of the Association for Computational Linguistics 2020 Maxime Peyrard, Robert West

The goal of text summarization is to compress documents to the relevant information while excluding background information already known to the receiver.

Text Summarization

Crosslingual Topic Modeling with WikiPDA

1 code implementation23 Sep 2020 Tiziano Piccardi, Robert West

We present Wikipedia-based Polyglot Dirichlet Allocation (WikiPDA), a crosslingual topic model that learns to represent Wikipedia articles written in any language as distributions over a common set of language-independent topics.

Matrix Completion

Adoption of Twitter's New Length Limit: Is 280 the New 140?

no code implementations16 Sep 2020 Kristina Gligorić, Ashton Anderson, Robert West

The prevalence of tweets around 140 characters before the switch in a given language is strongly correlated with the prevalence of tweets around 280 characters after the switch in the same language, and very long tweets are vastly more popular on Web clients than on mobile clients.

Calibration of Google Trends Time Series

1 code implementation27 Jul 2020 Robert West

In the offline preprocessing phase, an "anchor bank" is constructed, a set of queries spanning the full spectrum of popularity, all calibrated against a common reference query by carefully chaining together multiple Google Trends requests.

Time Series

A Ladder of Causal Distances

1 code implementation5 May 2020 Maxime Peyrard, Robert West

Causal discovery, the task of automatically constructing a causal model from data, is of major significance across the sciences.

Causal Discovery

On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation

1 code implementation ACL 2020 Wei Zhao, Goran Glavaš, Maxime Peyrard, Yang Gao, Robert West, Steffen Eger

We systematically investigate a range of metrics based on state-of-the-art cross-lingual semantic representations obtained with pretrained M-BERT and LASER.

Language Modelling Machine Translation +3

Behavior Cloning in OpenAI using Case Based Reasoning

no code implementations23 Feb 2020 Chad Peters, Babak Esfandiari, Mohamad Zalat, Robert West

Learning from Observation (LfO), also known as Behavioral Cloning, is an approach for building software agents by recording the behavior of an expert (human or artificial) and using the recorded data to generate the required behavior.

OpenAI Gym

Learning High Order Feature Interactions with Fine Control Kernels

no code implementations9 Feb 2020 Hristo Paskov, Alex Paskov, Robert West

We provide a methodology for learning sparse statistical models that use as features all possible multiplicative interactions among an underlying atomic set of features.

Sparse Learning

WikiHist.html: English Wikipedia's Full Revision History in HTML Format

1 code implementation28 Jan 2020 Blagoj Mitrevski, Tiziano Piccardi, Robert West

Wikipedia is written in the wikitext markup language.

Computers and Society

Quantifying Engagement with Citations on Wikipedia

1 code implementation23 Jan 2020 Tiziano Piccardi, Miriam Redi, Giovanni Colavizza, Robert West

Wikipedia, the free online encyclopedia that anyone can edit, is one of the most visited sites on the Web and a common source of information for many users.

Computers and Society

Robust Cross-lingual Embeddings from Parallel Sentences

2 code implementations28 Dec 2019 Ali Sabet, Prakhar Gupta, Jean-Baptiste Cordonnier, Robert West, Martin Jaggi

Recent advances in cross-lingual word embeddings have primarily relied on mapping-based methods, which project pretrained word embeddings from different languages into a shared space through a linear transformation.

Cross-Lingual Document Classification Document Classification +2

Deep Learning for Prostate Pathology

no code implementations11 Oct 2019 Okyaz Eminaga, Yuri Tolkach, Christian Kunder, Mahmood Abbas, Ryan Han, Rosalie Nolley, Axel Semjonow, Martin Boegemann, Sebastian Huss, Andreas Loening, Robert West, Geoffrey Sonn, Richard Fan, Olaf Bettendorf, James Brook, Daniel Rubin

For case usage, these models were applied for the annotation tasks in clinician-oriented pathology reports for prostatectomy specimens.

Auditing Radicalization Pathways on YouTube

1 code implementation22 Aug 2019 Manoel Horta Ribeiro, Raphael Ottoni, Robert West, Virgílio A. F. Almeida, Wagner Meira

Non-profits, as well as the media, have hypothesized the existence of a radicalization pipeline on YouTube, claiming that users systematically progress towards more extreme content on the platform.

Computers and Society Social and Information Networks

Privacy-Preserving Classification with Secret Vector Machines

1 code implementation8 Jul 2019 Valentin Hartmann, Konark Modi, Josep M. Pujol, Robert West

Second, we implement SecVM's distributed framework for the Cliqz web browser and deploy it for predicting user gender in a large-scale online evaluation with thousands of clients, outperforming baselines by a large margin and thus showcasing that SecVM is suitable for production environments.

Classification Federated Learning +1

Privacy-Preserving Distributed Learning with Secret Gradient Descent

no code implementations27 Jun 2019 Valentin Hartmann, Robert West

In many important application domains of machine learning, data is a privacy-sensitive resource.

Crosslingual Document Embedding as Reduced-Rank Ridge Regression

1 code implementation8 Apr 2019 Martin Josifoski, Ivan S. Paskov, Hristo S. Paskov, Martin Jaggi, Robert West

Finally, although not trained for embedding sentences and words, it also achieves competitive performance on crosslingual sentence and word retrieval tasks.

Document Embedding Document-level

Eliciting New Wikipedia Users' Interests via Automatically Mined Questionnaires: For a Warm Welcome, Not a Cold Start

1 code implementation8 Apr 2019 Ramtin Yazdanian, Leila Zia, Jonathan Morgan, Bahodir Mansurov, Robert West

As such, these systems cannot make high-quality recommendations to newcomers without any previous interactions -- the so-called cold-start problem.

Recommendation Systems

Hot Streaks on Social Media

1 code implementation5 Apr 2019 Kiran Garimella, Robert West

We show that user impact tends to have certain characteristics: First, impact is clustered in time, such that the most impactful tweets of a user appear close to each other.

Social and Information Networks

Expanding the Text Classification Toolbox with Cross-Lingual Embeddings

no code implementations23 Mar 2019 Meryem M'hamdi, Robert West, Andreea Hossmann, Michael Baeriswyl, Claudiu Musat

In particular, we test the hypothesis that embeddings with context are more effective, by multi-tasking the learning of multilingual word embeddings and text classification; we explore neural architectures for CLTC; and we move from bi- to multi-lingual word embeddings.

Classification General Classification +4

Reverse-Engineering Satire, or "Paper on Computational Humor Accepted Despite Making Serious Advances"

1 code implementation10 Jan 2019 Robert West, Eric Horvitz

Starting from the observation that satirical news headlines tend to resemble serious news headlines, we build and analyze a corpus of satirical headlines paired with nearly identical but serious headlines.

Humor Detection

Measuring Societal Biases from Text Corpora with Smoothed First-Order Co-occurrence

no code implementations13 Dec 2018 Navid Rekabsaz, Robert West, James Henderson, Allan Hanbury

The common approach to measuring such biases using a corpus is by calculating the similarities between the embedding vector of a word (like nurse) and the vectors of the representative words of the concepts of interest (such as genders).

Word Embeddings

Quootstrap: Scalable Unsupervised Extraction of Quotation-Speaker Pairs from Large News Corpora via Bootstrapping

1 code implementation7 Apr 2018 Dario Pavllo, Tiziano Piccardi, Robert West

We propose Quootstrap, a method for extracting quotations, as well as the names of the speakers who uttered them, from large news corpora.

Growing Wikipedia Across Languages via Recommendation

2 code implementations12 Apr 2016 Ellery Wulczyn, Robert West, Leila Zia, Jure Leskovec

The system involves identifying missing articles, ranking the missing articles according to their importance, and recommending important missing articles to editors based on their interests.

Social and Information Networks Digital Libraries

Exploiting Social Network Structure for Person-to-Person Sentiment Analysis

no code implementations TACL 2014 Robert West, Hristo S. Paskov, Jure Leskovec, Christopher Potts

Person-to-person evaluations are prevalent in all kinds of discourse and important for establishing reputations, building social bonds, and shaping public opinion.

Decision Making Sentiment Analysis

Cannot find the paper you are looking for? You can Submit a new open access paper.