2 code implementations • 22 Jan 2024 • Karel D'Oosterlinck, Omar Khattab, François Remy, Thomas Demeester, Chris Develder, Christopher Potts
Multi-label classification problems with thousands of classes are hard to solve with in-context learning alone, as language models (LMs) might lack prior knowledge about the precise classes or how to assign them, and it is generally infeasible to demonstrate every class in a prompt.
no code implementations • 27 Nov 2023 • François Remy, Kris Demuynck, Thomas Demeester
Our new multilingual model enables a range of languages to benefit from our advancements in biomedical semantic representation learning, opening a new avenue for bioinformatics researchers around the world.
no code implementations • 5 Oct 2023 • François Remy, Pieter Delobelle, Bettina Berendt, Kris Demuynck, Thomas Demeester
This one-to-many token mapping improves tremendously the initialization of the embedding table for the target language.
no code implementations • 31 Jul 2023 • François Remy, Simone Scaboro, Beatrice Portelli
Biomedical entity linking, also known as biomedical concept normalization, has recently witnessed the rise to prominence of zero-shot contrastive models.
no code implementations • 1 Jun 2023 • François Remy, Thomas Demeester
Conclusion: AGCT is a novel and valuable resource for biomedical tasks that require human-readable definitions for SnomedCT concepts.
1 code implementation • 22 May 2023 • Karel D'Oosterlinck, François Remy, Johannes Deleu, Thomas Demeester, Chris Develder, Klim Zaporojets, Aneiss Ghodsi, Simon Ellershaw, Jack Collins, Christopher Potts
We introduce BioDEX, a large-scale resource for Biomedical adverse Drug Event Extraction, rooted in the historical output of drug safety reporting in the U. S. BioDEX consists of 65k abstracts and 19k full-text biomedical papers with 256k associated document-level safety reports created by medical experts.
no code implementations • 11 May 2023 • François Remy, Alfiya Khabibullina, Thomas Demeester
This paper shines a light on the potential of definition-based semantic models for detecting idiomatic and semi-idiomatic multiword expressions (MWEs) in clinical terminology.
no code implementations • 21 Oct 2022 • François Remy, Kris Demuynck, Thomas Demeester
This work introduces BioLORD, a new pre-training strategy for producing meaningful representations for clinical sentences and biomedical concepts.