TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Sentence Embeddings For Biomedical Texts	BIOSSES	BioSentVec (PubMed + MIMIC-III)	Pearson Correlation	0.795	# 7
Sentence Embeddings For Biomedical Texts	BIOSSES	BioSentVec (MIMIC-III)	Pearson Correlation	0.350	# 12
Sentence Embeddings For Biomedical Texts	BIOSSES	BioSentVec (PubMed)	Pearson Correlation	0.817	# 4
Sentence Embeddings For Biomedical Texts	BIOSSES	Universal Sentence Encoder	Pearson Correlation	0.345	# 13
Sentence Embeddings For Biomedical Texts	MedSTS	BioSentVec (PubMed + MIMIC-III)	Pearson Correlation	0.767	# 1
Sentence Embeddings For Biomedical Texts	MedSTS	BioSentVec (MIMIC-III)	Pearson Correlation	0.759	# 2
Sentence Embeddings For Biomedical Texts	MedSTS	BioSentVec (PubMed)	Pearson Correlation	0.750	# 3
Sentence Embeddings For Biomedical Texts	MedSTS	Universal Sentence Encoder	Pearson Correlation	0.714	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/biosentvec-creating-sentence-embeddings-for/sentence-embeddings-for-biomedical-texts-on-2)](https://paperswithcode.com/sota/sentence-embeddings-for-biomedical-texts-on-2?p=biosentvec-creating-sentence-embeddings-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/biosentvec-creating-sentence-embeddings-for/sentence-embeddings-for-biomedical-texts-on)](https://paperswithcode.com/sota/sentence-embeddings-for-biomedical-texts-on?p=biosentvec-creating-sentence-embeddings-for)`

BioSentVec: creating sentence embeddings for biomedical texts

22 Oct 2018 · Qingyu Chen, Yifan Peng, Zhiyong Lu ·

Sentence embeddings have become an essential part of today's natural language processing (NLP) systems, especially together advanced deep learning methods. Although pre-trained sentence encoders are available in the general domain, none exists for biomedical texts to date. In this work, we introduce BioSentVec: the first open set of sentence embeddings trained with over 30 million documents from both scholarly articles in PubMed and clinical notes in the MIMIC-III Clinical Database. We evaluate BioSentVec embeddings in two sentence pair similarity tasks in different text genres. Our benchmarking results demonstrate that the BioSentVec embeddings can better capture sentence semantics compared to the other competitive alternatives and achieve state-of-the-art performance in both tasks. We expect BioSentVec to facilitate the research and development in biomedical text mining and to complement the existing resources in biomedical word embeddings. BioSentVec is publicly available at https://github.com/ncbi-nlp/BioSentVec

PDF Abstract

Code

Add Remove Mark official

ncbi-nlp/BioSentVec official

553

ncbi-nlp/BioWordVec official

138

ncbi-nlp/BLUE_Benchmark

274

ESBigeard/paper_graph

Tasks

Add Remove

Benchmarking

Sentence

Sentence Embeddings

Sentence Embeddings For Biomedical Texts

Word Embeddings

Datasets

BIOSSES

Results from the Paper

Edit

Ranked #1 on Sentence Embeddings For Biomedical Texts on MedSTS (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Sentence Embeddings For Biomedical Texts	BIOSSES	BioSentVec (PubMed + MIMIC-III)	Pearson Correlation	0.795	# 7	Compare
Sentence Embeddings For Biomedical Texts	BIOSSES	BioSentVec (MIMIC-III)	Pearson Correlation	0.350	# 12	Compare
Sentence Embeddings For Biomedical Texts	BIOSSES	BioSentVec (PubMed)	Pearson Correlation	0.817	# 4	Compare
Sentence Embeddings For Biomedical Texts	BIOSSES	Universal Sentence Encoder	Pearson Correlation	0.345	# 13	Compare
Sentence Embeddings For Biomedical Texts	MedSTS	BioSentVec (PubMed + MIMIC-III)	Pearson Correlation	0.767	# 1	Compare
Sentence Embeddings For Biomedical Texts	MedSTS	BioSentVec (MIMIC-III)	Pearson Correlation	0.759	# 2	Compare
Sentence Embeddings For Biomedical Texts	MedSTS	BioSentVec (PubMed)	Pearson Correlation	0.750	# 3	Compare
Sentence Embeddings For Biomedical Texts	MedSTS	Universal Sentence Encoder	Pearson Correlation	0.714	# 4	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

BioSentVec: creating sentence embeddings for biomedical texts

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove