KAMEL (Knowledge Analysis with Multitoken Entities in Language Models)

Introduced by Kalo et al. in KAMEL : Knowledge Analysis with Multitoken Entities in Language Models

KAMEL comprises knowledge about 234 relations from Wikidata with a large training, validation, and test dataset. We make sure that all facts are also present in Wikipedia so that they have been seen during the pre-training procedure of the LMs we are probing. Most importantly we overcome the limitations of existing probing datasets by (1) having a larger variety of knowledge graph relations, (2) it contains single- and multi-token entities, (3) we use relations with literals, and (4) have alternative labels for entities. (5) Furthermore, we created an evaluation procedure for higher cardinality relations, which was missing in previous works, and (6) make sure that the dataset can be used for causal LMs.

Homepage

Benchmarks

Add a new result Link an existing benchmark

Trend	Task	Dataset Variant	Best Model	Paper	Code
	Probing Language Models	KAMEL	OPT-13b

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

JanKalo/KAMEL

KAMEL (Knowledge Analysis with Multitoken Entities in Language Models)

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Similar Datasets

BEAR-big

BioLAMA

KMIR

Usage

License

Modalities

Languages

KAMEL (Knowledge Analysis with Multitoken Entities in Language Models)

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

BEAR-big

BioLAMA

KMIR

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages