ViNLI (Vietnamese Natural Language Inference Dataset)

Introduced by Huynh et al. in ViNLI: A Vietnamese Corpus for Studies on Open-Domain Natural Language Inference

A large-scale and high-quality corpus is necessary for studies on NLI for Vietnamese, which can be considered a low-resource language. In this paper, we introduce ViNLI (Vietnamese Natural Language Inference), an open-domain and high-quality corpus for evaluating Vietnamese NLI models, which is created and evaluated with a strict process of quality control. ViNLI comprises over 30,000 human-annotated premise-hypothesis sentence pairs extracted from more than 800 online news articles on 13 distinct topics.

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

No data loaders found. You can submit your data loader here.

Tasks

Similar Datasets

UIT-ViCoV19QA

UIT-ViQuAD

OCNLI

KorNLI

Usage

License

Unknown

Modalities

Languages

Vietnamese