GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text

14 Aug 2023  ยท  PengFei Liu, Yiming Ren, Jun Tao, Zhixiang Ren ยท

Large language models have made significant strides in natural language processing, enabling innovative applications in molecular science by processing textual representations of molecules. However, most existing language models cannot capture the rich information with complex molecular structures or images. In this paper, we introduce GIT-Mol, a multi-modal large language model that integrates the Graph, Image, and Text information. To facilitate the integration of multi-modal molecular data, we propose GIT-Former, a novel architecture that is capable of aligning all modalities into a unified latent space. We achieve a 5%-10% accuracy increase in properties prediction and a 20.2% boost in molecule generation validity compared to the baselines. With the any-to-language molecular translation strategy, our model has the potential to perform more downstream tasks, such as compound name recognition and chemical reaction prediction.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Drug Discovery BACE GIT-Mol(G+S) AUC 0.8108 # 6
Drug Discovery BBBP GIT-Mol(G+S) AUC 0.739 # 3
Image Captioning ChEBI-20 GIT-Mol BLEU 0.924 # 1
Exact 0.461 # 1
Levenshtein 6.575 # 1
MACCS FTS 0.962 # 1
RDK FTS 0.906 # 1
Morgan FTS 0.894 # 1
Validity 0.899 # 1
Text-based de novo Molecule Generation ChEBI-20 GIT-Mol-caption BLEU 75.6 # 14
Exact Match 5.1 # 18
Levenshtein 0.26315 # 17
MACCS FTS 73.8 # 16
RDK FTS 58.2 # 17
Morgan FTS 51.9 # 17
Validity 92.8 # 7
Exact 0.051 # 1
Drug Discovery ClinTox GIT-Mol(G+S) AUC 0.883 # 2
Drug Discovery SIDER GIT-Mol(G+S) AUC 0.634 # 3
Drug Discovery Tox21 GIT-Mol(G+S) AUC 0.759 # 10
Drug Discovery ToxCast GIT-Mol(G+S) AUC 0.668 # 4

Methods


No methods listed for this paper. Add relevant methods here