Linguistic Acceptability

47 papers with code • 5 benchmarks • 5 datasets

Linguistic Acceptability is the task of determining whether a sentence is grammatical or ungrammatical.

Benchmarks

Add a Result

These leaderboards are used to track progress in Linguistic Acceptability

Dataset	Best Model	Compare
CoLA	En-BERT + TDA + PCA	See all
RuCoLA	Ru-RoBERTa+TDA	See all
CoLA Dev	En-BERT + TDA	See all
ItaCoLA	XLM-R + TDA	See all
DaLAJ	Sw-BERT + H0M	See all

Libraries

Use these libraries to find Linguistic Acceptability models and implementations

huggingface/transformers

7 papers

124,889

Tencent/TurboTransformers

3 papers

1,442

awslabs/mlm-scoring

3 papers

330

epfml/collaborative-attention

3 papers

145

See all 19 libraries.

Datasets

Latest papers

Most implemented Social Latest No code

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

huggingface/transformers-bloom-inference • • 15 Aug 2022

We develop a procedure for Int8 matrix multiplication for feed-forward and attention projection layers in transformers, which cut the memory needed for inference by half while retaining full precision performance.

547

15 Aug 2022

Paper
Code

Acceptability Judgements via Examining the Topology of Attention Maps

danchern97/tda4la • 19 May 2022

The role of the attention mechanism in encoding linguistic knowledge has received special interest in NLP.

19 May 2022

Paper
Code

VALUE: Understanding Dialect Disparity in NLU

salt-nlp/value • ACL 2022

To understand disparities in current models and to facilitate more dialect-competent NLU systems, we introduce the VernAcular Language Understanding Evaluation (VALUE) benchmark, a challenging variant of GLUE that we created with a set of lexical and morphosyntactic transformation rules.

06 Apr 2022

Paper
Code

data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

huggingface/transformers • • Preprint 2022

While the general idea of self-supervised learning is identical across modalities, the actual algorithms and objectives differ widely because they were developed with a single modality in mind.

124,857

07 Feb 2022

Paper
Code

Monolingual and Cross-Lingual Acceptability Judgments with the Italian CoLA corpus

dhfbk/itacola-dataset • Findings (EMNLP) 2021

The development of automated approaches to linguistic acceptability has been greatly fostered by the availability of the English CoLA corpus, which has also been included in the widely used GLUE benchmark.

24 Sep 2021

Paper
Code

General Cross-Architecture Distillation of Pretrained Language Models into Matrix Embeddings

lgalke/cross-architecture-distillation • • 17 Sep 2021

We match or exceed the scores of ELMo for all tasks of the GLUE benchmark except for the sentiment analysis task SST-2 and the linguistic acceptability task CoLA.

17 Sep 2021

Paper
Code

Charformer: Fast Character Transformers via Gradient-based Subword Tokenization

google-research/google-research • • ICLR 2022

In this paper, we propose a new model inductive bias that learns a subword tokenization end-to-end as part of the model.

32,798

23 Jun 2021

Paper
Code

Language Models Use Monotonicity to Assess NPI Licensing

jumelet/monotonicity-npi-lm • • Findings (ACL) 2021

We investigate the semantic knowledge of language models (LMs), focusing on (1) whether these LMs create categories of linguistic environments based on their semantic monotonicity properties, and (2) whether these categories play a similar role in LMs as in human language understanding, using negative polarity item licensing as a case study.

28 May 2021

Paper
Code

FNet: Mixing Tokens with Fourier Transforms

labmlai/annotated_deep_learning_paper_implementations • • NAACL 2022

At longer input lengths, our FNet model is significantly faster: when compared to the "efficient" Transformers on the Long Range Arena benchmark, FNet matches the accuracy of the most accurate models, while outpacing the fastest models across all sequence lengths on GPUs (and across relatively shorter lengths on TPUs).

47,906

09 May 2021

Paper
Code

Entailment as Few-Shot Learner

PaddlePaddle/PaddleNLP • • 29 Apr 2021

Large pre-trained language models (LMs) have demonstrated remarkable ability as few-shot learners.

11,406

29 Apr 2021

Paper
Code

Linguistic Acceptability

Benchmarks Add a Result

Libraries

Datasets

Latest papers

Content

Benchmarks

Add a Result