Search Results for author: Shay B. Cohen

Found 97 papers, 49 papers with code

Split and Rephrase

2 code implementations • EMNLP 2017 • Shashi Narayan, Claire Gardent, Shay B. Cohen, Anastasia Shimorina

We propose a new sentence simplification task (Split-and-Rephrase) where the aim is to split a complex sentence into a meaning preserving sequence of shorter sentences.

Machine Translation Sentence +2

974

Paper
Code

Stock Movement Prediction from Tweets and Historical Prices

1 code implementation • ACL 2018 • Yumo Xu, Shay B. Cohen

Stock movement prediction is a challenging problem: the market is highly stochastic, and we make temporally-dependent predictions from chaotic data.

Ranked #2 on Stock Market Prediction on stocknet (using extra training data)

Feature Engineering Time Series Analysis +1

530

Paper
Code

Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization

3 code implementations • EMNLP 2018 • Shashi Narayan, Shay B. Cohen, Mirella Lapata

We introduce extreme summarization, a new single-document summarization task which does not favor extractive strategies and calls for an abstractive modeling approach.

Ranked #9 on Text Summarization on X-Sum

Document Summarization Extreme Summarization +1

339

Paper
Code

What is this Article about? Extreme Summarization with Topic-aware Convolutional Neural Networks

1 code implementation • 19 Jul 2019 • Shashi Narayan, Shay B. Cohen, Mirella Lapata

We introduce 'extreme summarization', a new single-document summarization task which aims at creating a short, one-sentence news summary answering the question ``What is the article about?''.

Document Summarization Extreme Summarization +1

339

Paper
Code

Ranking Sentences for Extractive Summarization with Reinforcement Learning

1 code implementation • NAACL 2018 • Shashi Narayan, Shay B. Cohen, Mirella Lapata

In this paper we conceptualize extractive summarization as a sentence ranking task and propose a novel training algorithm which globally optimizes the ROUGE evaluation metric through a reinforcement learning objective.

Ranked #13 on Extractive Text Summarization on CNN / Daily Mail

Document Summarization Extractive Summarization +4

271

Paper
Code

Neural Extractive Summarization with Side Information

1 code implementation • 14 Apr 2017 • Shashi Narayan, Nikos Papasarantopoulos, Shay B. Cohen, Mirella Lapata

Most extractive summarization methods focus on the main body of the document from which sentences need to be extracted.

Document Summarization Extractive Summarization +2

Paper
Code

Whodunnit? Crime Drama as a Case for Natural Language Understanding

1 code implementation • TACL 2018 • Lea Frermann, Shay B. Cohen, Mirella Lapata

In this paper we argue that crime drama exemplified in television programs such as CSI:Crime Scene Investigation is an ideal testbed for approximating real-world natural language understanding and the complex inferences associated with it.

Natural Language Understanding

Paper
Code

An Incremental Parser for Abstract Meaning Representation

4 code implementations • EACL 2017 • Marco Damonte, Shay B. Cohen, Giorgio Satta

We describe a transition-based parser for AMR that parses sentences left-to-right, in linear time.

Ranked #5 on AMR Parsing on LDC2015E86

AMR Parsing Named Entity Recognition +3

Paper
Code

Discourse Representation Structure Parsing

1 code implementation • ACL 2018 • Jiangming Liu, Shay B. Cohen, Mirella Lapata

We introduce an open-domain neural semantic parser which generates formal meaning representations in the style of Discourse Representation Theory (DRT; Kamp and Reyle 1993).

Question Answering Semantic Parsing

Paper
Code

Multilingual Clustering of Streaming News

2 code implementations • EMNLP 2018 • Sebastião Miranda, Artūrs Znotiņš, Shay B. Cohen, Guntis Barzdins

Clustering news across languages enables efficient media monitoring by aggregating articles from multilingual sources into coherent stories.

Clustering

Paper
Code

Structural Neural Encoders for AMR-to-text Generation

2 code implementations • NAACL 2019 • Marco Damonte, Shay B. Cohen

AMR-to-text generation is a problem recently introduced to the NLP community, in which the goal is to generate sentences from Abstract Meaning Representation (AMR) graphs.

Ranked #2 on Graph-to-Sequence on LDC2015E86:

AMR-to-Text Generation Graph-to-Sequence +1

Paper
Code

The Larger They Are, the Harder They Fail: Language Models do not Recognize Identifier Swaps in Python

1 code implementation • 24 May 2023 • Antonio Valerio Miceli-Barone, Fazl Barez, Ioannis Konstas, Shay B. Cohen

Large Language Models (LLMs) have successfully been applied to code generation tasks, raising the question of how well these models understand programming.

Code Generation

Paper
Code

Cross-lingual Abstract Meaning Representation Parsing

1 code implementation • NAACL 2018 • Marco Damonte, Shay B. Cohen

Abstract Meaning Representation (AMR) annotation efforts have mostly focused on English.

Paper
Code

Jointly Extracting and Compressing Documents with Summary State Representations

1 code implementation • NAACL 2019 • Afonso Mendes, Shashi Narayan, Sebastião Miranda, Zita Marinho, André F. T. Martins, Shay B. Cohen

We present a new neural model for text summarization that first extracts sentences from a document and then compresses them.

Extractive Summarization Text Summarization

Paper
Code

A Root of a Problem: Optimizing Single-Root Dependency Parsing

1 code implementation • EMNLP 2021 • Miloš Stanojević, Shay B. Cohen

We describe two approaches to single-root dependency parsing that yield significant speed ups in such parsing.

Dependency Parsing

Paper
Code

Privacy-preserving Neural Representations of Text

1 code implementation • EMNLP 2018 • Maximin Coavoux, Shashi Narayan, Shay B. Cohen

This article deals with adversarial attacks towards deep learning systems for Natural Language Processing (NLP), in the context of privacy protection.

Privacy Preserving

Paper
Code

Compositional Languages Emerge in a Neural Iterated Learning Model

1 code implementation • ICLR 2020 • Yi Ren, Shangmin Guo, Matthieu Labeau, Shay B. Cohen, Simon Kirby

The principle of compositionality, which enables natural language to represent complex concepts via a structured combination of simpler ones, allows us to convey an open-ended set of messages using a limited vocabulary.

Paper
Code

Abstractive Summarization Guided by Latent Hierarchical Document Structure

1 code implementation • 17 Nov 2022 • Yifu Qiu, Shay B. Cohen

Sequential abstractive neural summarizers often do not use the underlying structure in the input article or dependencies between the input sentences.

Abstractive Text Summarization Sentence

Paper
Code

Large Language Models Relearn Removed Concepts

1 code implementation • 3 Jan 2024 • Michelle Lo, Shay B. Cohen, Fazl Barez

This demonstrates that models exhibit polysemantic capacities and can blend old and new concepts in individual neurons.

Model Editing

Paper
Code

Factorizing Content and Budget Decisions in Abstractive Summarization of Long Documents

1 code implementation • 25 May 2022 • Marcio Fonseca, Yftah Ziser, Shay B. Cohen

We argue that disentangling content selection from the budget used to cover salient content improves the performance and applicability of abstractive summarizers.

Ranked #1 on Text Summarization on GovReport

Abstractive Text Summarization Disentanglement +2

Paper
Code

Latent-Variable Synchronous CFGs for Hierarchical Translation

1 code implementation • EMNLP 2014 • Avneesh Saluja, Chris Dyer, Shay B. Cohen

Translation

Paper
Code

Lightweight, Dynamic Graph Convolutional Networks for AMR-to-Text Generation

1 code implementation • EMNLP 2020 • Yan Zhang, Zhijiang Guo, Zhiyang Teng, Wei Lu, Shay B. Cohen, Zuozhu Liu, Lidong Bing

With the help of these strategies, we are able to train a model with fewer parameters while maintaining the model capacity.

AMR-to-Text Generation Text Generation

Paper
Code

LeanReasoner: Boosting Complex Logical Reasoning with Lean

1 code implementation • 20 Mar 2024 • Dongwei Jiang, Marcio Fonseca, Shay B. Cohen

Large language models (LLMs) often struggle with complex logical reasoning due to logical inconsistencies and the inherent difficulty of such reasoning.

Automated Theorem Proving Logical Reasoning

Paper
Code

Machine Reading of Historical Events

1 code implementation • ACL 2020 • Or Honovich, Lucas Torroba Hennigen, Omri Abend, Shay B. Cohen

Machine reading is an ambitious goal in NLP that subsumes a wide range of text understanding capabilities.

Information Retrieval Reading Comprehension +1

Paper
Code

A Closer Look into the Robustness of Neural Dependency Parsers Using Better Adversarial Examples

1 code implementation • Findings (ACL) 2021 • Yuxuan Wang, Wanxiang Che, Ivan Titov, Shay B. Cohen, Zhilin Lei, Ting Liu

Paper
Code

Document Modeling with External Attention for Sentence Extraction

1 code implementation • ACL 2018 • Shashi Narayan, Ronald Cardenas, Nikos Papasarantopoulos, Shay B. Cohen, Mirella Lapata, Jiangsheng Yu, Yi Chang

Document modeling is essential to a variety of natural language understanding tasks.

Answer Selection Document Summarization +8

Paper
Code

Learning Typed Entailment Graphs with Global Soft Constraints

1 code implementation • TACL 2018 • Mohammad Javad Hosseini, Nathanael Chambers, Siva Reddy, Xavier R. Holt, Shay B. Cohen, Mark Johnson, Mark Steedman

We instead propose a scalable method that learns globally consistent similarity scores based on new soft constraints that consider both the structures across typed entailment graphs and inside each graph.

Graph Learning

Paper
Code

Detecting and Mitigating Hallucinations in Multilingual Summarisation

1 code implementation • 23 May 2023 • Yifu Qiu, Yftah Ziser, Anna Korhonen, Edoardo M. Ponti, Shay B. Cohen

With the existing faithful metrics focusing on English, even measuring the extent of this phenomenon in cross-lingual settings is hard.

Cross-Lingual Transfer

Paper
Code

Are Large Language Models Temporally Grounded?

1 code implementation • 14 Nov 2023 • Yifu Qiu, Zheng Zhao, Yftah Ziser, Anna Korhonen, Edoardo M. Ponti, Shay B. Cohen

Instead, we provide LLMs with textual narratives and probe them with respect to their common-sense knowledge of the structure and duration of events, their ability to order events along a timeline, and self-consistency within their temporal model (e. g., temporal relations such as after and before are mutually exclusive for any pair of events).

Common Sense Reasoning In-Context Learning +2

Paper
Code

Duality of Link Prediction and Entailment Graph Induction

1 code implementation • ACL 2019 • Mohammad Javad Hosseini, Shay B. Cohen, Mark Johnson, Mark Steedman

The new entailment score outperforms prior state-of-the-art results on a standard entialment dataset and the new link prediction scores show improvements over the raw link prediction scores.

Link Prediction

Paper
Code

Co-training an Unsupervised Constituency Parser with Weak Supervision

1 code implementation • Findings (ACL) 2022 • Nickil Maveli, Shay B. Cohen

We introduce a method for unsupervised parsing that relies on bootstrapping classifiers to identify if a node dominates a specific span in a sentence.

Ranked #5 on Constituency Grammar Induction on PTB Diagnostic ECG Database (using extra training data)

Constituency Grammar Induction Inductive Bias +1

Paper
Code

Open-Domain Contextual Link Prediction and its Complementarity with Entailment Graphs

1 code implementation • Findings (EMNLP) 2021 • Mohammad Javad Hosseini, Shay B. Cohen, Mark Johnson, Mark Steedman

In this paper, we introduce the new task of open-domain contextual link prediction which has access to both the textual context and the KG structure to perform link prediction.

Link Prediction

Paper
Code

Gold Doesn't Always Glitter: Spectral Removal of Linear and Nonlinear Guarded Attribute Information

1 code implementation • 15 Mar 2022 • Shun Shao, Yftah Ziser, Shay B. Cohen

We describe a simple and effective method (Spectral Attribute removaL; SAL) to remove private or guarded information from neural representations.

Attribute

Paper
Code

Obfuscation for Privacy-preserving Syntactic Parsing

1 code implementation • WS 2020 • Zhifeng Hu, Serhii Havrylov, Ivan Titov, Shay B. Cohen

We introduce an idea for a privacy-preserving transformation on natural language data, inspired by homomorphic encryption.

Privacy Preserving Sentence

Paper
Code

Learning to Match Mathematical Statements with Proofs

2 code implementations • 3 Feb 2021 • Maximin Coavoux, Shay B. Cohen

The task is designed to improve the processing of research-level mathematical texts.

Automated Theorem Proving Information Retrieval +1

Paper
Code

BERT is not The Count: Learning to Match Mathematical Statements with Proofs

1 code implementation • 18 Feb 2023 • Weixian Waylon Li, Yftah Ziser, Maximin Coavoux, Shay B. Cohen

While the first decoding method matches a proof to a statement without being aware of other statements or proofs, the second method treats the task as a global matching problem.

Information Retrieval Retrieval

Paper
Code

Knowledge Base Question Answering for Space Debris Queries

1 code implementation • 31 May 2023 • Paul Darm, Antonio Valerio Miceli-Barone, Shay B. Cohen, Annalisa Riccardi

In this work we present a system, developed for the European Space Agency (ESA), that can answer complex natural language queries, to support engineers in accessing the information contained in a KB that models the orbital space debris environment.

Knowledge Base Question Answering Natural Language Queries

Paper
Code

The Role of Reentrancies in Abstract Meaning Representation Parsing

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Ida Szubert, Marco Damonte, Shay B. Cohen, Mark Steedman

Abstract Meaning Representation (AMR) parsing aims at converting sentences into AMR representations.

AMR Parsing

Paper
Code

Causal Explanations for Sequential Decision-Making in Multi-Agent Systems

1 code implementation • 21 Feb 2023 • Balint Gyevnar, Cheng Wang, Christopher G. Lucas, Shay B. Cohen, Stefano V. Albrecht

We present CEMA: Causal Explanations in Multi-Agent systems; a framework for creating causal natural language explanations of an agent's decisions in dynamic sequential multi-agent systems to build more trustworthy autonomous agents.

Autonomous Driving counterfactual +2

Paper
Code

PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for Languages in India

1 code implementation • 15 May 2023 • Ashok Urlana, Pinzhen Chen, Zheng Zhao, Shay B. Cohen, Manish Shrivastava, Barry Haddow

This paper introduces PMIndiaSum, a multilingual and massively parallel summarization corpus focused on languages in India.

Cross-Lingual Abstractive Summarization Multilingual NLP +1

Paper
Code

Sentence-Incremental Neural Coreference Resolution

1 code implementation • 26 May 2023 • Matt Grenander, Shay B. Cohen, Mark Steedman

We propose a sentence-incremental neural coreference resolution system which incrementally builds clusters after marking mention boundaries in a shift-reduce method.

coreference-resolution Sentence

Paper
Code

A Joint Matrix Factorization Analysis of Multilingual Representations

1 code implementation • 24 Oct 2023 • Zheng Zhao, Yftah Ziser, Bonnie Webber, Shay B. Cohen

Using this tool, we study to what extent and how morphosyntactic features are reflected in the representations learned by multilingual pre-trained models.

Paper
Code

Unlexicalized Transition-based Discontinuous Constituency Parsing

1 code implementation • TACL 2019 • Maximin Coavoux, Benoît Crabbé, Shay B. Cohen

Lexicalized parsing models are based on the assumptions that (i) constituents are organized around a lexical head (ii) bilexical statistics are crucial to solve ambiguities.

Constituency Parsing

Paper
Code

Discontinuous Constituency Parsing with a Stack-Free Transition System and a Dynamic Oracle

1 code implementation • NAACL 2019 • Maximin Coavoux, Shay B. Cohen

We introduce a novel transition system for discontinuous constituency parsing.

Constituency Parsing Sentence

Paper
Code

Semantic Role Labeling with Iterative Structure Refinement

1 code implementation • IJCNLP 2019 • Chunchuan Lyu, Shay B. Cohen, Ivan Titov

Modern state-of-the-art Semantic Role Labeling (SRL) methods rely on expressive sentence encoders (e. g., multi-layer LSTMs) but tend to model only local (if any) interactions between individual argument labeling decisions.

Semantic Role Labeling Sentence

Paper
Code

On the Trade-off between Redundancy and Local Coherence in Summarization

1 code implementation • 20 May 2022 • Ronald Cardenas, Matthias Galle, Shay B. Cohen

Extractive summarization systems are known to produce poorly coherent and, if not accounted for, highly redundant text.

Extractive Summarization Reading Comprehension +1

Paper
Code

AMR Parsing is Far from Solved: GrAPES, the Granular AMR Parsing Evaluation Suite

1 code implementation • 6 Dec 2023 • Jonas Groschwitz, Shay B. Cohen, Lucia Donatelli, Meaghan Fowlie

We present the Granular AMR Parsing Evaluation Suite (GrAPES), a challenge set for Abstract Meaning Representation (AMR) parsing with accompanying evaluation metrics.

AMR Parsing Sentence

Paper
Code

Canonical Correlation Inference for Mapping Abstract Scenes to Text

no code implementations • 9 Aug 2016 • Nikos Papasarantopoulos, Helen Jiang, Shay B. Cohen

We describe a technique for structured prediction, based on canonical correlation analysis.

Structured Prediction

Paper
Add Code

Paraphrase Generation from Latent-Variable PCFGs for Semantic Parsing

no code implementations • WS 2016 • Shashi Narayan, Siva Reddy, Shay B. Cohen

One of the limitations of semantic parsing approaches to open-domain question answering is the lexicosyntactic gap between natural language questions and knowledge base entries -- there are many ways to ask a question, all with the same answer.

Open-Domain Question Answering Paraphrase Generation +2

Paper
Add Code

Encoding Prior Knowledge with Eigenword Embeddings

no code implementations • TACL 2016 • Dominique Osborne, Shashi Narayan, Shay B. Cohen

Canonical correlation analysis (CCA) is a method for reducing the dimension of data represented using two views.

Word Embeddings

Paper
Add Code

Optimizing Spectral Learning for Parsing

no code implementations • ACL 2016 • Shashi Narayan, Shay B. Cohen

We describe a search algorithm for optimizing the number of latent states when estimating latent-variable PCFGs with spectral methods.

Paper
Add Code

Parsing Linear Context-Free Rewriting Systems with Fast Matrix Multiplication

no code implementations • CL 2016 • Shay B. Cohen, Daniel Gildea

Our result provides another proof for the best known result for parsing mildly context sensitive formalisms such as combinatory categorial grammars, head grammars, linear indexed grammars, and tree adjoining grammars, which can be parsed in time $O(n^{4. 76})$.

Paper
Add Code

Low-Rank Approximation of Weighted Tree Automata

no code implementations • 4 Nov 2015 • Guillaume Rabusseau, Borja Balle, Shay B. Cohen

We describe a technique to minimize weighted tree automata (WTA), a powerful formalisms that subsumes probabilistic context-free grammars (PCFGs) and latent-variable PCFGs.

Paper
Add Code

Diversity in Spectral Learning for Natural Language Parsing

no code implementations • EMNLP 2015 • Shashi Narayan, Shay B. Cohen

We describe an approach to create a diverse set of predictions with spectral learning of latent-variable PCFGs (L-PCFGs).

Paper
Add Code

The Visualization of Change in Word Meaning over Time using Temporal Word Embeddings

no code implementations • 18 Oct 2014 • Chiraag Lala, Shay B. Cohen

We describe a visualization tool that can be used to view the change in meaning of words over time.

Word Embeddings

Paper
Add Code

The SUMMA Platform Prototype

no code implementations • EACL 2017 • Renars Liepins, Ulrich Germann, Guntis Barzdins, Alex Birch, ra, Steve Renals, Susanne Weber, Peggy van der Kreeft, Herv{\'e} Bourlard, Jo{\~a}o Prieto, Ond{\v{r}}ej Klejch, Peter Bell, Alex Lazaridis, ros, Alfonso Mendes, Sebastian Riedel, Mariana S. C. Almeida, Pedro Balage, Shay B. Cohen, Tomasz Dwojak, Philip N. Garner, Andreas Giefer, Marcin Junczys-Dowmunt, Hina Imran, David Nogueira, Ahmed Ali, Mir, Sebasti{\~a}o a, Andrei Popescu-Belis, Lesly Miculicich Werlen, Nikos Papasarantopoulos, Abiola Obamuyide, Clive Jones, Fahim Dalvi, Andreas Vlachos, Yang Wang, Sibo Tong, Rico Sennrich, Nikolaos Pappas, Shashi Narayan, Marco Damonte, Nadir Durrani, Sameer Khurana, Ahmed Abdelali, Hassan Sajjad, Stephan Vogel, David Sheppey, Chris Hernon, Jeff Mitchell

We present the first prototype of the SUMMA Platform: an integrated platform for multilingual media monitoring.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

Abstract Meaning Representation for Paraphrase Detection

no code implementations • NAACL 2018 • Fuad Issa, Marco Damonte, Shay B. Cohen, Xiaohui Yan, Yi Chang

Abstract Meaning Representation (AMR) parsing aims at abstracting away from the syntactic realization of a sentence, and denote only its meaning in a canonical form.

AMR Parsing Sentence

Paper
Add Code

Semi-Supervised Learning of Sequence Models with Method of Moments

no code implementations • EMNLP 2016 • Zita Marinho, Andr{\'e} F. T. Martins, Shay B. Cohen, Noah A. Smith

Part-Of-Speech Tagging

Paper
Add Code

Local String Transduction as Sequence Labeling

no code implementations • COLING 2018 • Joana Ribeiro, Shashi Narayan, Shay B. Cohen, Xavier Carreras

We show that the general problem of string transduction can be reduced to the problem of sequence labeling.

Lemmatization Machine Translation +6

Paper
Add Code

Tensor Decomposition for Fast Parsing with Latent-Variable PCFGs

no code implementations • NeurIPS 2012 • Michael Collins, Shay B. Cohen

We describe an approach to speed-up inference with latent variable PCFGs, which have been shown to be highly effective for natural language parsing.

Tensor Decomposition

Paper
Add Code

Empirical Risk Minimization with Approximations of Probabilistic Grammars

no code implementations • NeurIPS 2010 • Noah A. Smith, Shay B. Cohen

Probabilistic grammars are generative statistical models that are useful for compositional and sequential structures.

Paper
Add Code

Empirical Risk Minimization for Probabilistic Grammars: Sample Complexity and Hardness of Learning

no code implementations • CL 2012 • Shay B. Cohen, Noah A. Smith

Machine Translation Question Answering

Paper
Add Code

Online Adaptor Grammars with Hybrid Inference

no code implementations • TACL 2014 • Ke Zhai, Jordan Boyd-Graber, Shay B. Cohen

Adaptor grammars are a flexible, powerful formalism for defining nonparametric, unsupervised models of grammar productions.

Topic Models Variational Inference

Paper
Add Code

Lexical Inference over Multi-Word Predicates: A Distributional Approach

no code implementations • ACL 2014 • Omri Abend, Shay B. Cohen, Mark Steedman

Paper
Add Code

A Provably Correct Learning Algorithm for Latent-Variable PCFGs

no code implementations • ACL 2014 • Shay B. Cohen, Michael Collins

Language Modelling Topic Models

Paper
Add Code

Spectral Unsupervised Parsing with Additive Tree Metrics

no code implementations • ACL 2014 • Ankur P. Parikh, Shay B. Cohen, Eric P. Xing

Language Acquisition Matrix Completion

Paper
Add Code

The effect of non-tightness on Bayesian estimation of PCFGs

no code implementations • ACL 2013 • Mark Johnson, Shay B. Cohen

Paper
Add Code

Spectral Learning of Latent-Variable PCFGs

no code implementations • ACL 2012 • Shay B. Cohen, Karl Stratos, Michael Collins, Dean P. Foster, Lyle Ungar

Paper
Add Code

Lexical Event Ordering with an Edge-Factored Model

no code implementations • HLT 2015 • Mark Steedman, Omri Abend, Shay B. Cohen

Natural Language Inference Sentence +3

Paper
Add Code

Experiments with Spectral Learning of Latent-Variable PCFGs

no code implementations • NAACL 2013 • Shay B. Cohen, Karl Stratos, Michael Collins, Dean P. Foster, Lyle Ungar

Dependency Parsing

Paper
Add Code

Approximate PCFG Parsing Using Tensor Decomposition

no code implementations • NAACL 2013 • Shay B. Cohen, Giorgio Satta, Michael Collins

Tensor Decomposition

Paper
Add Code

Conversation Trees: A Grammar Model for Topic Structure in Forums

no code implementations • EMNLP 2015 • Annie Louis, Shay B. Cohen

Paper
Add Code

A Coactive Learning View of Online Structured Prediction in Statistical Machine Translation

no code implementations • CONLL 2015 • Artem Sokolov, Stefan Riezler, Shay B. Cohen

Machine Translation Structured Prediction +1

Paper
Add Code

Spectral Learning of Refinement HMMs

no code implementations • WS 2013 • Karl Stratos, Alex Rush, er, Shay B. Cohen, Michael Collins

Paper
Add Code

Wide-Coverage Neural A* Parsing for Minimalist Grammars

no code implementations • ACL 2019 • John Torr, Milos Stanojevic, Mark Steedman, Shay B. Cohen

Minimalist Grammars (Stabler, 1997) are a computationally oriented, and rigorous formalisation of many aspects of Chomsky{'}s (1995) Minimalist Program.

Sentence

Paper
Add Code

Discourse Representation Parsing for Sentences and Documents

no code implementations • ACL 2019 • Jiangming Liu, Shay B. Cohen, Mirella Lapata

We introduce a novel semantic parsing task based on Discourse Representation Theory (DRT; Kamp and Reyle 1993).

Semantic Parsing Sentence

Paper
Add Code

Partners in Crime: Multi-view Sequential Inference for Movie Understanding

no code implementations • IJCNLP 2019 • Nikos Papasarantopoulos, Lea Frermann, Mirella Lapata, Shay B. Cohen

Multi-view learning algorithms are powerful representation learning tools, often exploited in the context of multimodal problems.

MULTI-VIEW LEARNING Representation Learning

Paper
Add Code

Experimenting with Power Divergences for Language Modeling

no code implementations • IJCNLP 2019 • Matthieu Labeau, Shay B. Cohen

In this paper, we experiment with several families (alpha, beta and gamma) of power divergences, generalized from the KL divergence, for learning language models with an objective different than standard MLE.

Language Modelling

Paper
Add Code

Discourse Representation Structure Parsing with Recurrent Neural Networks and the Transformer Model

no code implementations • WS 2019 • Jiangming Liu, Shay B. Cohen, Mirella Lapata

Our best system achieves a score of 84. 8{\%} F1 in the DRS parsing shared task.

Ranked #2 on DRS Parsing on PMB-2.2.0

DRS Parsing Machine Translation +1

Paper
Add Code

Bottom-Up Unranked Tree-to-Graph Transducers for Translation into Semantic Graphs

no code implementations • WS 2019 • Johanna Bj{\"o}rklund, Shay B. Cohen, Frank Drewes, Giorgio Satta

We propose a formal model for translating unranked syntactic trees, such as dependency trees, into semantic graphs.

Semantic Parsing Translation

Paper
Add Code

Multi-Step Inference for Reasoning Over Paragraphs

no code implementations • EMNLP 2020 • Jiangming Liu, Matt Gardner, Shay B. Cohen, Mirella Lapata

Complex reasoning over text requires understanding and chaining together free-form predicates and logical connectives.

Logical Reasoning

Paper
Add Code

Learning Dialog Policies from Weak Demonstrations

no code implementations • ACL 2020 • Gabriel Gordon-Hall, Philip John Gorinski, Shay B. Cohen

Deep reinforcement learning is a promising approach to training a dialog manager, but current methods struggle with the large state and action spaces of multi-domain dialog systems.

Atari Games Q-Learning +2

Paper
Add Code

Dscorer: A Fast Evaluation Metric for Discourse Representation Structure Parsing

no code implementations • ACL 2020 • Jiangming Liu, Shay B. Cohen, Mirella Lapata

Discourse representation structures (DRSs) are scoped semantic representations for texts of arbitrary length.

Paper
Add Code

Nonparametric Learning of Two-Layer ReLU Residual Units

1 code implementation • 17 Aug 2020 • Zhunxuan Wang, Linyun He, Chunchuan Lyu, Shay B. Cohen

We describe an algorithm that learns two-layer residual units using rectified linear unit (ReLU) activation: suppose the input $\mathbf{x}$ is from a distribution with support space $\mathbb{R}^d$ and the ground-truth generative model is a residual unit of this type, given by $\mathbf{y} = \boldsymbol{B}^\ast\left[\left(\boldsymbol{A}^\ast\mathbf{x}\right)^+ + \mathbf{x}\right]$, where ground-truth network parameters $\boldsymbol{A}^\ast \in \mathbb{R}^{d\times d}$ represent a full-rank matrix with nonnegative entries and $\boldsymbol{B}^\ast \in \mathbb{R}^{m\times d}$ is full-rank with $m \geq d$ and for $\boldsymbol{c} \in \mathbb{R}^d$, $[\boldsymbol{c}^{+}]_i = \max\{0, c_i\}$.

Vocal Bursts Valence Prediction

Paper
Code

Reducing Quantity Hallucinations in Abstractive Summarization

no code implementations • Findings of the Association for Computational Linguistics 2020 • Zheng Zhao, Shay B. Cohen, Bonnie Webber

It is well-known that abstractive summaries are subject to hallucination---including material that is not supported by the original text.

Abstractive Text Summarization Hallucination

Paper
Add Code

A Differentiable Relaxation of Graph Segmentation and Alignment for AMR Parsing

no code implementations • EMNLP 2021 • Chunchuan Lyu, Shay B. Cohen, Ivan Titov

In contrast, we treat both alignment and segmentation as latent variables in our model and induce them as part of end-to-end training.

Ranked #22 on AMR Parsing on LDC2017T10

AMR Parsing Segmentation +1

Paper
Add Code

Narration Generation for Cartoon Videos

no code implementations • 17 Jan 2021 • Nikos Papasarantopoulos, Shay B. Cohen

Research on text generation from multimodal inputs has largely focused on static images, and less on video data.

Text Generation

Paper
Add Code

Unsupervised Extractive Summarization by Human Memory Simulation

no code implementations • 16 Apr 2021 • Ronald Cardenas, Matthias Galle, Shay B. Cohen

We introduce a wide range of heuristics that leverage cognitive representations of content units and how these are retained or forgotten in human memory.

Extractive Summarization Unsupervised Extractive Summarization

Paper
Add Code

Text Generation from Discourse Representation Structures

no code implementations • NAACL 2021 • Jiangming Liu, Shay B. Cohen, Mirella Lapata

We propose neural models to generate text from formal meaning representations based on Discourse Representation Structures (DRSs).

Text Generation

Paper
Add Code

English-to-Chinese Transliteration with Phonetic Auxiliary Task

no code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Yuan He, Shay B. Cohen

Approaching named entities transliteration as a Neural Machine Translation (NMT) problem is common practice.

Multi-Task Learning NMT +2

Paper
Add Code

Universal Discourse Representation Structure Parsing

no code implementations • CL (ACL) 2021 • Jiangming Liu, Shay B. Cohen, Mirella Lapata, Johan Bos

Abstract We consider the task of crosslingual semantic parsing in the style of Discourse Representation Theory (DRT) where knowledge from annotated corpora in a resource-rich language is transferred via bitext to guide learning in other languages.

Semantic Parsing

Paper
Add Code

Understanding Domain Learning in Language Models Through Subpopulation Analysis

1 code implementation • 22 Oct 2022 • Zheng Zhao, Yftah Ziser, Shay B. Cohen

We investigate how different domains are encoded in modern neural network architectures.

Language Modelling

Paper
Code

Can Large Language Models Follow Concept Annotation Guidelines? A Case Study on Scientific and Financial Domains

no code implementations • 15 Nov 2023 • Marcio Fonseca, Shay B. Cohen

Although large language models (LLMs) exhibit remarkable capacity to leverage in-context demonstrations, it is still unclear to what extent they can learn new concepts or facts from ground-truth labels.

counterfactual Sentence +1

Paper
Add Code

Think While You Write: Hypothesis Verification Promotes Faithful Knowledge-to-Text Generation

no code implementations • 16 Nov 2023 • Yifu Qiu, Varun Embar, Shay B. Cohen, Benjamin Han

Knowledge-to-text generators often struggle to faithfully generate descriptions for the input facts: they may produce hallucinations that contradict the input, or describe facts not present in the input.

Natural Language Inference Text Generation

Paper
Add Code

Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals?

no code implementations • 18 Jan 2024 • Marcio Fonseca, Shay B. Cohen

Also, we show that we can improve the controllability of LLMs with keyword-based classifier-free guidance (CFG) while achieving lexical overlap comparable to strong fine-tuned baselines on arXiv and PubMed.

Language Modelling Large Language Model +1

Paper
Add Code

`Keep it Together': Enforcing Cohesion in Extractive Summaries by Simulating Human Memory

no code implementations • 16 Feb 2024 • Ronald Cardenas, Matthias Galle, Shay B. Cohen

Extractive summaries are usually presented as lists of sentences with no expected cohesion between them.

Informativeness Sentence

Paper
Add Code

Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions

no code implementations • 23 Feb 2024 • Clement Neo, Shay B. Cohen, Fazl Barez

In this paper, we investigate the interplay between attention heads and specialized "next-token" neurons in the Multilayer Perceptron that predict specific tokens.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.