Token Classification

29 papers with code • 10 benchmarks • 9 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Token Classification

Trend	Dataset	Best Model	Paper	Code	Compare

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

WangchanBERTa: Pretraining transformer-based Thai Language Models

vistec-AI/thai2transformers • • 24 Jan 2021

However, for a relatively low-resource language such as Thai, the choices of models are limited to training a BERT-based model based on a much smaller dataset or finetuning multi-lingual models, both of which yield suboptimal downstream performance.

Paper
Code

Detecting Label Errors in Token Classification Data

cleanlab/cleanlab • • 8 Oct 2022

Mislabeled examples are a common issue in real-world data, particularly for tasks like token classification where many labels must be chosen on a fine-grained basis.

Paper
Code

Label Supervised LLaMA Finetuning

4ai/ls-llama • 2 Oct 2023

We evaluate this approach with Label Supervised LLaMA (LS-LLaMA), based on LLaMA-2-7B, a relatively small-scale LLM, and can be finetuned on a single GeForce RTX4090 GPU.

Paper
Code

Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction

chongzhangfdu/tpp • 17 Oct 2023

However, BIO-tagging scheme relies on the correct order of model inputs, which is not guaranteed in real-world NER on scanned VrDs where text are recognized and arranged by OCR systems.

Paper
Code

Investigating Entity Knowledge in BERT with Simple Neural End-To-End Entity Linking

samuelbroscheit/entity_knowledge_in_bert • • CONLL 2019

We show on an entity linking benchmark that (i) this model improves the entity representations over plain BERT, (ii) that it outperforms entity linking architectures that optimize the tasks separately and (iii) that it only comes second to the current state-of-the-art that does mention detection and entity disambiguation jointly.

Paper
Code

Common-Knowledge Concept Recognition for SEVA

jitinkrishnan/NASA-SE • • 26 Mar 2020

We build a common-knowledge concept recognition system for a Systems Engineer's Virtual Assistant (SEVA) which can be used for downstream tasks such as relation extraction, knowledge graph construction, and question-answering.

Paper
Code

Counterfactual Detection meets Transfer Learning

Kc2fresh/Extracting-Counterfactual-data • 27 May 2020

We can consider Counterfactuals as belonging in the domain of Discourse structure and semantics, A core area in Natural Language Understanding and in this paper, we introduce an approach to resolving counterfactual detection as well as the indexing of the antecedents and consequents of Counterfactual statements.

Paper
Code

On Long-Tailed Phenomena in Neural Machine Translation

vyraun/long-tailed • • Findings of the Association for Computational Linguistics 2020

State-of-the-art Neural Machine Translation (NMT) models struggle with generating low-frequency tokens, tackling which remains a major challenge.

Paper
Code

Indic-Transformers: An Analysis of Transformer Language Models for Indian Languages

Neural-Space/indic-transformers • 4 Nov 2020

Language models based on the Transformer architecture have achieved state-of-the-art performance on a wide range of NLP tasks such as text classification, question-answering, and token classification.

Paper
Code

NLRG at SemEval-2021 Task 5: Toxic Spans Detection Leveraging BERT-based Token Classification and Span Prediction Techniques

gchhablani/toxic-spans-detection • • SEMEVAL 2021

In our paper, we explore simple versions of both of these approaches and their performance on the task.

Paper
Code

Token Classification

Benchmarks Add a Result

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result