Coherence Evaluation

13 papers with code • 2 benchmarks • 1 datasets

Evaluating the overall coherence of text as measured by its readability and flow through ideas.

Benchmarks

Add a Result

These leaderboards are used to track progress in Coherence Evaluation

Trend	Dataset	Best Model	Paper	Code	Compare
	GCDC + RST - Accuracy	MTL with Transformer			See all
	GCDC + RST - F1	RST-Ensemble			See all

Datasets

GCDC

Most implemented papers

Most implemented Social Latest No code

Transformer Models for Text Coherence Assessment

tushar117/Transformer-Models-for-Text-Coherence-Assessment • • 5 Sep 2021

Coherence is an important aspect of text quality and is crucial for ensuring its readability.

Paper
Code

Discourse Coherence in the Wild: A Dataset, Evaluation and Methods

aylai/GCDC-corpus • WS 2018

To date there has been very little work on assessing discourse coherence methods on real-world data.

Paper
Code

Neural RST-based Evaluation of Discourse Coherence

grig-guz/coherence-rst • • Asian Chapter of the Association for Computational Linguistics 2020

We evaluate our approach on the Grammarly Corpus for Discourse Coherence (GCDC) and show that when ensembled with the current state of the art, we can achieve the new state of the art accuracy on this benchmark.

Paper
Code

Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language Models

AnneBeyer/coherencegym • • NAACL 2021

Coherent discourse is distinguished from a mere collection of utterances by the satisfaction of a diverse set of constraints, for example choice of expression, logical relation between denoted events, and implicit compatibility with world-knowledge.

Paper
Code

Towards Quantifiable Dialogue Coherence Evaluation

James-Yip/QuantiDCE • • ACL 2021

To address these limitations, we propose Quantifiable Dialogue Coherence Evaluation (QuantiDCE), a novel framework aiming to train a quantifiable dialogue coherence metric that can reflect the actual human rating standards.

Paper
Code

DEAM: Dialogue Coherence Evaluation using AMR-based Semantic Manipulations

pluslabnlp/deam • ACL 2022

We also show that DEAM can distinguish between coherent and incoherent dialogues generated by baseline manipulations, whereas those baseline models cannot detect incoherent examples generated by DEAM.

Paper
Code

SNaC: Coherence Error Detection for Narrative Summarization

tagoyal/snac • • 19 May 2022

In this work, we introduce SNaC, a narrative coherence evaluation framework rooted in fine-grained annotations for long summaries.

Paper
Code

GisPy: A Tool for Measuring Gist Inference Score in Text

phosseini/GisPy • NAACL (WNU) 2022

Decision making theories such as Fuzzy-Trace Theory (FTT) suggest that individuals tend to rely on gist, or bottom-line meaning, in the text when making decisions.

Paper
Code

Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image Captioning

jingqiangchen/concaps • • 4 Feb 2023

Experiments also show that the generated captions are more coherent than that of baselines according to caption entity scores, caption Rouge scores, the two proposed coherence evaluation metrics, and human evaluations.

Paper
Code

The Problem of Coherence in Natural Language Explanations of Recommendations

jmraczynski/cer • • 18 Dec 2023

Providing natural language explanations for recommendations is particularly useful from the perspective of a non-expert user.

Paper
Code

Coherence Evaluation

Benchmarks Add a Result

Datasets

Most implemented papers

Content

Benchmarks

Add a Result