Multi-CPR (Multi Domain Chinese Dataset for Passage Retrieval)

Introduced by Long et al. in Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval

Multi-CPR is a multi-domain Chinese dataset for passage retrieval. The data is collected from three different domains, including E-commerce, Entertainment video, and Medical. Each dataset contains millions of passages and a certain amount of human-annotated query-passage-related pairs.

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages