TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Sentence Fusion	DiscoFuse	LaserTaggerAR	Exact	53.8	# 1
Sentence Fusion	DiscoFuse	LaserTaggerAR	SARI	85.5	# 1
Split and Rephrase	WikiSplit	LaserTaggerAR	Exact	15.2	# 5
Split and Rephrase	WikiSplit	LaserTaggerAR	BLEU	76.3	# 7
Split and Rephrase	WikiSplit	LaserTaggerAR	SARI	61.7	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/encode-tag-realize-high-precision-text/sentence-fusion-on-discofuse)](https://paperswithcode.com/sota/sentence-fusion-on-discofuse?p=encode-tag-realize-high-precision-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/encode-tag-realize-high-precision-text/split-and-rephrase-on-wikisplit)](https://paperswithcode.com/sota/split-and-rephrase-on-wikisplit?p=encode-tag-realize-high-precision-text)`

Encode, Tag, Realize: High-Precision Text Editing

IJCNLP 2019 · Eric Malmi, Sebastian Krause, Sascha Rothe, Daniil Mirylenka, Aliaksei Severyn ·

We propose LaserTagger - a sequence tagging approach that casts text generation as a text editing task. Target texts are reconstructed from the inputs using three main edit operations: keeping a token, deleting it, and adding a phrase before the token. To predict the edit operations, we propose a novel model, which combines a BERT encoder with an autoregressive Transformer decoder. This approach is evaluated on English text on four tasks: sentence fusion, sentence splitting, abstractive summarization, and grammar correction. LaserTagger achieves new state-of-the-art results on three of these tasks, performs comparably to a set of strong seq2seq baselines with a large number of training examples, and outperforms them when the number of examples is limited. Furthermore, we show that at inference time tagging can be more than two orders of magnitude faster than comparable seq2seq models, making it more attractive for running in a live environment.

PDF Abstract IJCNLP 2019 PDF IJCNLP 2019 Abstract

Code

Add Remove Mark official

google-research/lasertagger official

602

Mleader2/text_scalpel

207

a414351664/my_git_laser

googlx/lasertagger

leshanbog/lasertagger

Tasks

Add Remove

Abstractive Text Summarization

Sentence

Sentence Fusion

Split and Rephrase

TAG

Text Generation

Vocal Bursts Intensity Prediction

Datasets

WikiSplit

DiscoFuse

Results from the Paper

Edit

Ranked #1 on Sentence Fusion on DiscoFuse

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Sentence Fusion	DiscoFuse	LaserTaggerAR	Exact	53.8	# 1	Compare
Sentence Fusion	DiscoFuse	LaserTaggerAR	SARI	85.5	# 1	Compare
Split and Rephrase	WikiSplit	LaserTaggerAR	Exact	15.2	# 5	Compare
			BLEU	76.3	# 7	Compare
			SARI	61.7	# 1	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • Attention Dropout • BERT • BPE • Dense Connections • Dropout • GELU • Label Smoothing • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • LSTM • Multi-Head Attention • Position-Wise Feed-Forward Layer • ReLU • Residual Connection • Scaled Dot-Product Attention • Seq2Seq • Sigmoid Activation • Softmax • Tanh Activation • Transformer • Weight Decay • WordPiece

Edit Social Preview

Encode, Tag, Realize: High-Precision Text Editing

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove