Texts

CoWeSe (Corpus Web Salud Espanol)

Introduced by Carrino et al. in Spanish Biomedical Crawled Corpus: A Large, Diverse Dataset for Spanish Biomedical Language Models

CoWeSe is a Spanish biomedical corpus consisting of 4.5GB (about 750M tokens) of clean plain text. CoWeSe is the result of a massive crawler on 3000 Spanish domains executed in 2020.

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

No data loaders found. You can submit your data loader here.

Tasks

Behavioural cloning

Similar Datasets

2018 n2c2 (Track 2) - Adverse Drug Events and Medication Extraction

Usage

License

Unknown

Modalities

Texts

Languages

Spanish

CoWeSe (Corpus Web Salud Espanol)

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit