no code implementations • 16 Jul 2024 • Yuxuan Wu, Xiao Yi, Yang Tan, Huiqun Yu, Guisheng Fan
Protein retrieval, which targets the deconstruction of the relationship between sequences, structures and functions, empowers the advancing of biology.
1 code implementation • 23 Apr 2024 • Yang Tan, Mingchen Li, Bingxin Zhou, Bozitao Zhong, Lirong Zheng, Pan Tan, Ziyi Zhou, Huiqun Yu, Guisheng Fan, Liang Hong
Fine-tuning Pre-trained protein language models (PLMs) has emerged as a prominent strategy for enhancing downstream prediction tasks, often outperforming traditional supervised learning approaches.
1 code implementation • 26 Oct 2023 • Yang Tan, Mingchen Li, Pan Tan, Ziyi Zhou, Huiqun Yu, Guisheng Fan, Liang Hong
Moreover, despite the wealth of benchmarks and studies in the natural language community, there remains a lack of a comprehensive benchmark for systematically evaluating protein language model quality.
1 code implementation • 3 Sep 2023 • Yang Tan, Mingchen Li, Zijie Huang, Huiqun Yu, Guisheng Fan
Generative large language models (LLMs) have shown great success in various applications, including question-answering (QA) and dialogue systems.
no code implementations • 24 Jul 2023 • Pan Tan, Mingchen Li, Yuanxi Yu, Fan Jiang, Lirong Zheng, Banghao Wu, Xinyu Sun, Liqi Kang, Jie Song, Liang Zhang, Yi Xiong, Wanli Ouyang, Zhiqiang Hu, Guisheng Fan, Yufeng Pei, Liang Hong
Designing protein mutants of both high stability and activity is a critical yet challenging task in protein engineering.
no code implementations • 29 Dec 2022 • Mingchen Li, Liqi Kang, Yi Xiong, Yu Guang Wang, Guisheng Fan, Pan Tan, Liang Hong
Here, we develop SESNet, a supervised deep-learning model to predict the fitness for protein mutants by leveraging both sequence and structure information, and exploiting attention mechanism.