Linguistic Acceptability

47 papers with code • 5 benchmarks • 5 datasets

Linguistic Acceptability is the task of determining whether a sentence is grammatical or ungrammatical.

Benchmarks

Add a Result

These leaderboards are used to track progress in Linguistic Acceptability

Dataset	Best Model	Compare
CoLA	En-BERT + TDA + PCA	See all
RuCoLA	Ru-RoBERTa+TDA	See all
CoLA Dev	En-BERT + TDA	See all
ItaCoLA	XLM-R + TDA	See all
DaLAJ	Sw-BERT + H0M	See all

Libraries

Use these libraries to find Linguistic Acceptability models and implementations

huggingface/transformers

7 papers

124,527

Tencent/TurboTransformers

3 papers

1,440

awslabs/mlm-scoring

3 papers

330

epfml/collaborative-attention

3 papers

145

See all 19 libraries.

Datasets

Most implemented papers

Most implemented Social Latest No code

TinyBERT: Distilling BERT for Natural Language Understanding

huawei-noah/Pretrained-Language-Model • • Findings of the Association for Computational Linguistics 2020

To accelerate inference and reduce model size while maintaining accuracy, we first propose a novel Transformer distillation method that is specially designed for knowledge distillation (KD) of the Transformer-based models.

Paper
Code

SpanBERT: Improving Pre-training by Representing and Predicting Spans

facebookresearch/SpanBERT • • TACL 2020

We present SpanBERT, a pre-training method that is designed to better represent and predict spans of text.

Paper
Code

Masked Language Model Scoring

awslabs/mlm-scoring • • ACL 2020

Instead, we evaluate MLMs out of the box via their pseudo-log-likelihood scores (PLLs), which are computed by masking tokens one by one.

Paper
Code

SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization

namisan/mt-dnn • • ACL 2020

However, due to limited data resources from downstream tasks and the extremely large capacity of pre-trained models, aggressive fine-tuning often causes the adapted model to overfit the data of downstream tasks and forget the knowledge of the pre-trained model.

Paper
Code

Q8BERT: Quantized 8Bit BERT

NervanaSystems/nlp-architect • • 14 Oct 2019

Recently, pre-trained Transformer based language models such as BERT and GPT, have shown great improvement in many Natural Language Processing (NLP) tasks.

Paper
Code

RealFormer: Transformer Likes Residual Attention

google-research/google-research • • Findings (ACL) 2021

Transformer is the backbone of modern NLP models.

Paper
Code

SqueezeBERT: What can computer vision teach NLP about efficient neural networks?

huggingface/transformers • • EMNLP (sustainlp) 2020

Humans read and write hundreds of billions of messages every day.

Paper
Code

How to Train BERT with an Academic Budget

peteriz/academic-budget-bert • • EMNLP 2021

While large language models a la BERT are used ubiquitously in NLP, pretraining them is considered a luxury that only a few well-funded industry labs can afford.

Paper
Code

ERNIE 2.0: A Continual Pre-training Framework for Language Understanding

PaddlePaddle/ERNIE • • 29 Jul 2019

Recently, pre-trained models have achieved state-of-the-art results in various language understanding tasks, which indicates that pre-training on large-scale corpora may play a crucial role in natural language processing.

Paper
Code

GeDi: Generative Discriminator Guided Sequence Generation

salesforce/GeDi • • Findings (EMNLP) 2021

While large-scale language models (LMs) are able to imitate the distribution of natural language well enough to generate realistic text, it is difficult to control which regions of the distribution they generate.

Paper
Code

Linguistic Acceptability

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result