TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Question Answering	ConditionalQA	ETC-Pipeline	Conditional (answers)	39.4 / 41.8	# 3
Question Answering	ConditionalQA	ETC-Pipeline	Conditional (w/ conditions)	2.5 / 3.4	# 3
Question Answering	ConditionalQA	ETC-Pipeline	Overall (answers)	35.6 / 39.8	# 3
Question Answering	ConditionalQA	ETC-Pipeline	Overall (w/ conditions)	26.9 / 30.8	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/etc-encoding-long-and-structured-data-in/question-answering-on-conditionalqa)](https://paperswithcode.com/sota/question-answering-on-conditionalqa?p=etc-encoding-long-and-structured-data-in)`

ETC: Encoding Long and Structured Inputs in Transformers

EMNLP 2020 · Joshua Ainslie, Santiago Ontanon, Chris Alberti, Vaclav Cvicek, Zachary Fisher, Philip Pham, Anirudh Ravula, Sumit Sanghai, Qifan Wang, Li Yang ·

Transformer models have advanced the state of the art in many Natural Language Processing (NLP) tasks. In this paper, we present a new Transformer architecture, Extended Transformer Construction (ETC), that addresses two key challenges of standard Transformer architectures, namely scaling input length and encoding structured inputs. To scale attention to longer inputs, we introduce a novel global-local attention mechanism between global tokens and regular input tokens. We also show that combining global-local attention with relative position encodings and a Contrastive Predictive Coding (CPC) pre-training objective allows ETC to encode structured inputs. We achieve state-of-the-art results on four natural language datasets requiring long and/or structured inputs.

PDF Abstract EMNLP 2020 PDF EMNLP 2020 Abstract

Code

Add Remove Mark official

google-research/google-research official

32,804

google-research/longt5

169

Tasks

Add Remove

Position

Question Answering

Datasets

Natural Questions

HotpotQA

WikiHop

ConditionalQA

Results from the Paper

Edit

Ranked #3 on Question Answering on ConditionalQA

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Question Answering	ConditionalQA	ETC-Pipeline	Conditional (answers)	39.4 / 41.8	# 3	Compare
			Conditional (w/ conditions)	2.5 / 3.4	# 3	Compare
			Overall (answers)	35.6 / 39.8	# 3	Compare
			Overall (w/ conditions)	26.9 / 30.8	# 3	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Contrastive Predictive Coding • Dense Connections • Dropout • ETC • Global-Local Attention • InfoNCE • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Relative Position Encodings • ReLU • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

ETC: Encoding Long and Structured Inputs in Transformers

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove