An expert-annotated word similarity dataset which provides a highly reliable, yet challenging, benchmark for rare word representation techniques.

Source: Card-660: Cambridge Rare Word Dataset - a Reliable Benchmark for Infrequent Word Representation Models

License

  • Unknown

Modalities

Languages

Tasks