OCNLI (Original Chinese Natural Language Inference)

OCNLI stands for Original Chinese Natural Language Inference. It is corpus for Chinese Natural Language Inference, collected following closely the procedures of MNLI, but with enhanced strategies aiming for more challenging inference pairs. No human/machine translation is used in creating the dataset, and thus the Chinese texts are original and not translated.

OCNLI has roughly 50k pairs for training, 3k for development and 3k for test. Only the test data is released but not its labels.

OCNLI is part of the CLUE benchmark.

