Sentence segmentation

19 papers with code • 1 benchmarks • 3 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

Creating a Universal Dependencies Treebank of Spoken Frisian-Dutch Code-switched Data

universaldependencies/ud_frisian_dutch-fame 22 Feb 2021

This paper explores the difficulties of annotating transcribed spoken Dutch-Frisian code-switch utterances into Universal Dependencies.

Mukayese: Turkish NLP Strikes Back

alisafaya/mukayese Findings (ACL) 2022

As a solution, we present Mukayese, a set of NLP benchmarks for the Turkish language that contains several NLP tasks.

SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content

slateauthors/slate 8 Nov 2022

We present SLATE, a sequence labeling approach for extracting tasks from free-form content such as digitally handwritten (or "inked") notes on a virtual whiteboard.

Prosodic features improve sentence segmentation and parsing

ekayen/prosody_nlp 23 Feb 2023

Parsing spoken dialogue presents challenges that parsing text does not, including a lack of clear sentence boundaries.

Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation

bminixhofer/nnsplit 30 May 2023

Many NLP pipelines split text into sentences as one of the crucial preprocessing steps.

KG-GPT: A General Framework for Reasoning on Knowledge Graphs Using Large Language Models

jiho283/kg-gpt 17 Oct 2023

While large language models (LLMs) have made considerable advancements in understanding and generating unstructured text, their application in structured data remains underexplored.