Search Results for author: Zhangyang Gao

Found 44 papers, 26 papers with code

Advances of Deep Learning in Protein Science: A Comprehensive Survey

no code implementations • 8 Mar 2024 • Bozhen Hu, Cheng Tan, Lirong Wu, Jiangbin Zheng, Jun Xia, Zhangyang Gao, Zicheng Liu, Fandi Wu, Guijun Zhang, Stan Z. Li

Protein representation learning plays a crucial role in understanding the structure and function of proteins, which are essential biomolecules involved in various biological processes.

Drug Discovery Protein Function Prediction +2

Paper
Add Code

A Teacher-Free Graph Knowledge Distillation Framework with Dual Self-Distillation

1 code implementation • 6 Mar 2024 • Lirong Wu, Haitao Lin, Zhangyang Gao, Guojiang Zhao, Stan Z. Li

As a result, TGS enjoys the benefits of graph topology awareness in training but is free from data dependency in inference.

Knowledge Distillation

Paper
Code

Decoupling Weighing and Selecting for Integrating Multiple Graph Pre-training Tasks

1 code implementation • 3 Mar 2024 • Tianyu Fan, Lirong Wu, Yufei Huang, Haitao Lin, Cheng Tan, Zhangyang Gao, Stan Z. Li

In this paper, we identify two important collaborative processes for this topic: (1) select: how to select an optimal task combination from a given task pool based on their compatibility, and (2) weigh: how to weigh the selected tasks based on their importance.

Graph Representation Learning

Paper
Code

Re-Dock: Towards Flexible and Realistic Molecular Docking with Diffusion Bridge

no code implementations • 18 Feb 2024 • Yufei Huang, Odin Zhang, Lirong Wu, Cheng Tan, Haitao Lin, Zhangyang Gao, Siyuan Li, Stan. Z. Li

Accurate prediction of protein-ligand binding structures, a task known as molecular docking is crucial for drug design but remains challenging.

Molecular Docking

Paper
Add Code

PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for Efficient and Generalizable Compound-Protein Interaction Prediction

1 code implementation • 13 Feb 2024 • Lirong Wu, Yufei Huang, Cheng Tan, Zhangyang Gao, Bozhen Hu, Haitao Lin, Zicheng Liu, Stan Z. Li

Compound-Protein Interaction (CPI) prediction aims to predict the pattern and strength of compound-protein interactions for rational drug discovery.

Drug Discovery

Paper
Code

A Graph is Worth $K$ Words: Euclideanizing Graph using Pure Transformer

no code implementations • 4 Feb 2024 • Zhangyang Gao, Daize Dong, Cheng Tan, Jun Xia, Bozhen Hu, Stan Z. Li

Despite recent GNN and Graphformer efforts encoding graphs as Euclidean vectors, recovering original graph from the vectors remains a challenge.

Graph Classification Graph Generation +1

Paper
Add Code

FoldToken: Learning Protein Language via Vector Quantization and Beyond

no code implementations • 4 Feb 2024 • Zhangyang Gao, Cheng Tan, Jue Wang, Yufei Huang, Lirong Wu, Stan Z. Li

Is there a foreign language describing protein sequences and structures simultaneously?

Quantization

Paper
Add Code

MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning

no code implementations • 3 Feb 2024 • Zhe Li, Laurence T. Yang, Bocheng Ren, Xin Nie, Zhangyang Gao, Cheng Tan, Stan Z. Li

The scarcity of annotated data has sparked significant interest in unsupervised pre-training methods that leverage medical reports as auxiliary signals for medical visual representation learning.

Contrastive Learning Image Classification +5

Paper
Add Code

DCS-Net: Pioneering Leakage-Free Point Cloud Pretraining Framework with Global Insights

no code implementations • 3 Feb 2024 • Zhe Li, Zhangyang Gao, Cheng Tan, Stan Z. Li, Laurence T. Yang

Experimental results demonstrate that our method enhances the expressive capacity of existing point cloud models and effectively addresses the issue of information leakage.

Paper
Add Code

MMDesign: Multi-Modality Transfer Learning for Generative Protein Design

1 code implementation • 11 Dec 2023 • Jiangbin Zheng, Siyuan Li, Yufei Huang, Zhangyang Gao, Cheng Tan, Bozhen Hu, Jun Xia, Ge Wang, Stan Z. Li

Protein design involves generating protein sequences based on their corresponding protein backbones.

Data Augmentation Language Modelling +2

Paper
Code

Efficiently Predicting Protein Stability Changes Upon Single-point Mutation with Large Language Models

no code implementations • 7 Dec 2023 • Yijie Zhang, Zhangyang Gao, Cheng Tan, Stan Z. Li

Predicting protein stability changes induced by single-point mutations has been a persistent challenge over the years, attracting immense interest from numerous researchers.

Computational Efficiency

Paper
Add Code

Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models with Self-Consistency Training

1 code implementation • 23 Nov 2023 • Cheng Tan, Jingxuan Wei, Zhangyang Gao, Linzhuang Sun, Siyuan Li, Xihong Yang, Stan Z. Li

Remarkably, we show that even smaller base models, when equipped with our proposed approach, can achieve results comparable to those of larger models, illustrating the potential of our approach in harnessing the power of rationales for improved multimodal reasoning.

Multimodal Reasoning

Paper
Code

General Point Model with Autoencoding and Autoregressive

no code implementations • 25 Oct 2023 • Zhe Li, Zhangyang Gao, Cheng Tan, Stan Z. Li, Laurence T. Yang

This model is versatile, allowing fine-tuning for downstream point cloud representation tasks, as well as unconditional and conditional generation tasks.

Language Modelling Large Language Model +2

Paper
Add Code

Protein 3D Graph Structure Learning for Robust Structure-based Protein Property Prediction

no code implementations • 14 Oct 2023 • Yufei Huang, Siyuan Li, Jin Su, Lirong Wu, Odin Zhang, Haitao Lin, Jingqi Qi, Zihan Liu, Zhangyang Gao, Yuyang Liu, Jiangbin Zheng, Stan. ZQ. Li

To study this problem, we identify a Protein 3D Graph Structure Learning Problem for Robust Protein Property Prediction (PGSL-RP3), collect benchmark datasets, and present a protein Structure embedding Alignment Optimization framework (SAO) to mitigate the problem of structure embedding bias between the predicted and experimental protein structures.

Graph structure learning Property Prediction +2

Paper
Add Code

Revisiting the Temporal Modeling in Spatio-Temporal Predictive Learning under A Unified View

no code implementations • 9 Oct 2023 • Cheng Tan, Jue Wang, Zhangyang Gao, Siyuan Li, Lirong Wu, Jun Xia, Stan Z. Li

In this paper, we re-examine the two dominant temporal modeling approaches within the realm of spatio-temporal predictive learning, offering a unified perspective.

Self-Supervised Learning

Paper
Add Code

Enhancing Human-like Multi-Modal Reasoning: A New Challenging Dataset and Comprehensive Framework

1 code implementation • 24 Jul 2023 • Jingxuan Wei, Cheng Tan, Zhangyang Gao, Linzhuang Sun, Siyuan Li, Bihui Yu, Ruifeng Guo, Stan Z. Li

Multimodal reasoning is a critical component in the pursuit of artificial intelligence systems that exhibit human-like intelligence, especially when tackling complex tasks.

Contrastive Learning Multimodal Reasoning +2

Paper
Code

OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning

2 code implementations • NeurIPS 2023 • Cheng Tan, Siyuan Li, Zhangyang Gao, Wenfei Guan, Zedong Wang, Zicheng Liu, Lirong Wu, Stan Z. Li

Spatio-temporal predictive learning is a learning paradigm that enables models to learn spatial and temporal patterns by predicting future frames from given past frames in an unsupervised manner.

Weather Forecasting

567

Paper
Code

MotifRetro: Exploring the Combinability-Consistency Trade-offs in retrosynthesis via Dynamic Motif Editing

1 code implementation • 20 May 2023 • Zhangyang Gao, Xingran Chen, Cheng Tan, Stan Z. Li

Is there a unified framework for graph-based retrosynthesis prediction?

Retrosynthesis

Paper
Code

Knowledge-Design: Pushing the Limit of Protein Design via Knowledge Refinement

1 code implementation • 20 May 2023 • Zhangyang Gao, Cheng Tan, Stan Z. Li

After witnessing the great success of pretrained models on diverse protein-related tasks and the fact that recovery is highly correlated with confidence, we wonder whether this knowledge can push the limits of protein design further.

Ranked #1 on Word Sense Disambiguation on TS50

Protein Design Retrieval +1

140

Paper
Code

Cross-Gate MLP with Protein Complex Invariant Embedding is A One-Shot Antibody Designer

1 code implementation • 21 Apr 2023 • Cheng Tan, Zhangyang Gao, Lirong Wu, Jun Xia, Jiangbin Zheng, Xihong Yang, Yue Liu, Bozhen Hu, Stan Z. Li

In this paper, we propose a \textit{simple yet effective} model that can co-design 1D sequences and 3D structures of CDRs in a one-shot manner.

Specificity

Paper
Code

PrefixMol: Target- and Chemistry-aware Molecule Design via Prefix Embedding

no code implementations • 14 Feb 2023 • Zhangyang Gao, Yuqi Hu, Cheng Tan, Stan Z. Li

Is there a unified model for generating molecules considering different conditions, such as binding pockets and chemical properties?

Multi-Task Learning

Paper
Add Code

RDesign: Hierarchical Data-efficient Representation Learning for Tertiary Structure-based RNA Design

1 code implementation • 25 Jan 2023 • Cheng Tan, Yijie Zhang, Zhangyang Gao, Bozhen Hu, Siyuan Li, Zicheng Liu, Stan Z. Li

We crafted a large, well-curated benchmark dataset and designed a comprehensive structural modeling approach to represent the complex RNA tertiary structure.

Contrastive Learning Protein Design +2

Paper
Code

DiffSDS: A language diffusion model for protein backbone inpainting under geometric conditions and constraints

1 code implementation • 22 Jan 2023 • Zhangyang Gao, Cheng Tan, Stan Z. Li

Have you ever been troubled by the complexity and computational cost of SE(3) protein structure modeling and been amazed by the simplicity and power of language modeling?

Denoising Language Modelling

Paper
Code

RFold: RNA Secondary Structure Prediction with Decoupled Optimization

1 code implementation • 2 Dec 2022 • Cheng Tan, Zhangyang Gao, Stan Z. Li

The secondary structure of ribonucleic acid (RNA) is more stable and accessible in the cell than its tertiary structure, making it essential for functional prediction.

Paper
Code

SimVP: Towards Simple yet Powerful Spatiotemporal Predictive Learning

2 code implementations • 22 Nov 2022 • Cheng Tan, Zhangyang Gao, Siyuan Li, Stan Z. Li

Without introducing any extra tricks and strategies, SimVP can achieve superior performance on various benchmark datasets.

Ranked #1 on Video Prediction on Moving MNIST

Video Prediction

567

Paper
Code

Teaching Yourself: Graph Self-Distillation on Neighborhood for Node Classification

no code implementations • 5 Oct 2022 • Lirong Wu, Jun Xia, Haitao Lin, Zhangyang Gao, Zicheng Liu, Guojiang Zhao, Stan Z. Li

Despite their great academic success, Multi-Layer Perceptrons (MLPs) remain the primary workhorse for practical industrial applications.

Classification Node Classification

Paper
Add Code

PiFold: Toward effective and efficient protein inverse folding

1 code implementation • 22 Sep 2022 • Zhangyang Gao, Cheng Tan, Pablo Chacón, Stan Z. Li

How can we design protein sequences folding into the desired structures effectively and efficiently?

Protein Design

153

Paper
Code

A Survey on Generative Diffusion Model

1 code implementation • 6 Sep 2022 • Hanqun Cao, Cheng Tan, Zhangyang Gao, Yilun Xu, Guangyong Chen, Pheng-Ann Heng, Stan Z. Li

Deep generative models are a prominent approach for data generation, and have been used to produce high quality samples in various domains.

Dimensionality Reduction

841

Paper
Code

Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning

2 code implementations • CVPR 2023 • Cheng Tan, Zhangyang Gao, Lirong Wu, Yongjie Xu, Jun Xia, Siyuan Li, Stan Z. Li

Spatiotemporal predictive learning aims to generate future frames by learning from historical frames.

Ranked #12 on Video Prediction on Moving MNIST

Computational Efficiency Video Prediction

567

Paper
Code

CoSP: Co-supervised pretraining of pocket and ligand

no code implementations • 23 Jun 2022 • Zhangyang Gao, Cheng Tan, Lirong Wu, Stan Z. Li

Can we inject the pocket-ligand interaction knowledge into the pre-trained model and jointly learn their chemical space?

Contrastive Learning Specificity

Paper
Add Code

SimVP: Simpler yet Better Video Prediction

3 code implementations • CVPR 2022 • Zhangyang Gao, Cheng Tan, Lirong Wu, Stan Z. Li

From CNN, RNN, to ViT, we have witnessed remarkable advancements in video prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated training strategies.

Ranked #2 on Video Prediction on Human3.6M

Video Prediction

567

Paper
Code

Hyperspherical Consistency Regularization

1 code implementation • CVPR 2022 • Cheng Tan, Zhangyang Gao, Lirong Wu, Siyuan Li, Stan Z. Li

Though it benefits from taking advantage of both feature-dependent information from self-supervised learning and label-dependent information from supervised learning, this scheme remains suffering from bias of the classifier.

Contrastive Learning Self-Supervised Learning +1

Paper
Code

Generative De Novo Protein Design with Global Context

1 code implementation • 21 Apr 2022 • Cheng Tan, Zhangyang Gao, Jun Xia, Bozhen Hu, Stan Z. Li

Thus, we propose the Global-Context Aware generative de novo protein design method (GCA), consisting of local and global modules.

Protein Design Protein Structure Prediction

Paper
Code

SemiRetro: Semi-template framework boosts deep retrosynthesis prediction

no code implementations • 12 Feb 2022 • Zhangyang Gao, Cheng Tan, Lirong Wu, Stan Z. Li

Experimental results show that SemiRetro significantly outperforms both existing TB and TF methods.

Graph Learning Retrosynthesis

Paper
Add Code

Target-aware Molecular Graph Generation

no code implementations • 10 Feb 2022 • Cheng Tan, Zhangyang Gao, Stan Z. Li

Building on the recent advantages of flow-based molecular generation models, we propose SiamFlow, which forces the flow to fit the distribution of target sequence embeddings in latent space.

Drug Discovery Graph Generation +1

Paper
Add Code

AlphaDesign: A graph protein design method and benchmark on AlphaFoldDB

1 code implementation • 1 Feb 2022 • Zhangyang Gao, Cheng Tan, Stan Z. Li

While DeepMind has tentatively solved protein folding, its inverse problem -- protein design which predicts protein sequences from their 3D structures -- still faces significant challenges.

Protein Design Protein Folding

Paper
Code

An Empirical Study: Extensive Deep Temporal Point Process

1 code implementation • 19 Oct 2021 • Haitao Lin, Cheng Tan, Lirong Wu, Zhangyang Gao, Stan. Z. Li

In this paper, we first review recent research emphasis and difficulties in modeling asynchronous event sequences with deep temporal point process, which can be concluded into four fields: encoding of history sequence, formulation of conditional intensity function, relational discovery of events and learning approaches for optimization.

Graph structure learning Variational Inference

Paper
Code

Git: Clustering Based on Graph of Intensity Topology

2 code implementations • 4 Oct 2021 • Zhangyang Gao, Haitao Lin, Cheng Tan, Lirong Wu, Stan. Z Li

\textbf{A}ccuracy, \textbf{R}obustness to noises and scales, \textbf{I}nterpretability, \textbf{S}peed, and \textbf{E}asy to use (ARISE) are crucial requirements of a good clustering algorithm.

Ranked #1 on Clustering Algorithms Evaluation on Fashion-MNIST

Clustering Clustering Algorithms Evaluation

Paper
Code

GraphMixup: Improving Class-Imbalanced Node Classification on Graphs by Self-supervised Context Prediction

no code implementations • 21 Jun 2021 • Lirong Wu, Haitao Lin, Zhangyang Gao, Cheng Tan, Stan. Z. Li

Recent years have witnessed great success in handling node classification tasks with Graph Neural Networks (GNNs).

Node Classification

Paper
Add Code

Self-supervised Learning on Graphs: Contrastive, Generative,or Predictive

1 code implementation • 16 May 2021 • Lirong Wu, Haitao Lin, Zhangyang Gao, Cheng Tan, Stan. Z. Li

In this survey, we extend the concept of SSL, which first emerged in the fields of computer vision and natural language processing, to present a timely and comprehensive review of existing SSL techniques for graph data.

Self-Supervised Learning

1,276

Paper
Code

Conditional Local Convolution for Spatio-temporal Meteorological Forecasting

1 code implementation • 4 Jan 2021 • Haitao Lin, Zhangyang Gao, Yongjie Xu, Lirong Wu, Ling Li, Stan. Z. Li

We further propose the distance and orientation scaling terms to reduce the impacts of irregular spatial distribution.

Spatio-Temporal Forecasting Weather Forecasting

Paper
Code

Towards Robust Graph Neural Networks against Label Noise

no code implementations • 1 Jan 2021 • Jun Xia, Haitao Lin, Yongjie Xu, Lirong Wu, Zhangyang Gao, Siyuan Li, Stan Z. Li

A pseudo label is computed from the neighboring labels for each node in the training set using LP; meta learning is utilized to learn a proper aggregation of the original and pseudo label as the final label.

Attribute Learning with noisy labels +3

Paper
Add Code

LookHops: light multi-order convolution and pooling for graph classification

no code implementations • 28 Dec 2020 • Zhangyang Gao, Haitao Lin, Stan. Z Li

Convolution and pooling are the key operations to learn hierarchical representation for graph classification, where more expressive $k$-order($k>1$) method requires more computation cost, limiting the further applications.

General Classification Graph Classification

Paper
Add Code

Clustering Based on Graph of Density Topology

1 code implementation • 24 Sep 2020 • Zhangyang Gao, Haitao Lin, Stan Z. Li

GDT jointly considers the local and global structures of data samples: firstly forming local clusters based on a density growing process with a strategy for properly noise handling as well as cluster boundary detection; and then estimating a GDT from relationship between local clusters in terms of a connectivity measure, givingglobal topological graph.

Boundary Detection Clustering

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.