PubMedCite is a domain-specific dataset with about 192K biomedical scientific papers and a large citation graph preserving 917K citation relationships between them.
3 PAPERS • NO BENCHMARKS YET
…Given a text query and list of molecules without any reference textual information (represented, for example, as SMILES strings, graphs, or other equivalent representations) retrieve the molecule corresponding This requires the integration of two very different types of information: the structured knowledge represented by text and the chemical properties present in molecular graphs.
24 PAPERS • 4 BENCHMARKS
…Models trained with SourceData-NLP will furthermore enable the development of tools able to extract causal hypotheses from the literature and assemble them into knowledge graphs.
0 PAPER • NO BENCHMARKS YET