Search Results for author: Haitao Lin

Found 49 papers, 27 papers with code

Learning to Model Graph Structural Information on MLPs via Graph Structure Self-Contrasting

no code implementations9 Sep 2024 Lirong Wu, Haitao Lin, Guojiang Zhao, Cheng Tan, Stan Z. Li

In this paper, we rethink the roles played by graph structural information in graph data training and identify that message passing is not the only path to modeling structural information.

Teach Harder, Learn Poorer: Rethinking Hard Sample Distillation for GNN-to-MLP Knowledge Distillation

1 code implementation20 Jul 2024 Lirong Wu, Yunfan Liu, Haitao Lin, Yufei Huang, Stan Z. Li

To bridge the gaps between powerful Graph Neural Networks (GNNs) and lightweight Multi-Layer Perceptron (MLPs), GNN-to-MLP Knowledge Distillation (KD) proposes to distill knowledge from a well-trained teacher GNN into a student MLP.

Knowledge Distillation

CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph

1 code implementation16 Jun 2024 Haitao Lin, Guojiang Zhao, Odin Zhang, Yufei Huang, Lirong Wu, Zicheng Liu, Siyuan Li, Cheng Tan, Zhifeng Gao, Stan Z. Li

To broaden the scope, we have adapted these models to a range of tasks essential in drug design, which are considered sub-tasks within the graph fill-in-the-blank tasks.

Fairness

Learning to Predict Mutation Effects of Protein-Protein Interactions by Microenvironment-aware Hierarchical Prompt Learning

1 code implementation16 May 2024 Lirong Wu, Yijun Tian, Haitao Lin, Yufei Huang, Siyuan Li, Nitesh V Chawla, Stan Z. Li

Protein-protein bindings play a key role in a variety of fundamental biological processes, and thus predicting the effects of amino acid mutations on protein-protein binding is crucial.

Deep Lead Optimization: Leveraging Generative AI for Structural Modification

no code implementations30 Apr 2024 Odin Zhang, Haitao Lin, HUI ZHANG, Huifeng Zhao, Yufei Huang, Yuansheng Huang, Dejun Jiang, Chang-Yu Hsieh, Peichen Pan, Tingjun Hou

Through this lens, de novo design can incorporate strategies from lead optimization to address the challenge of generating hard-to-synthesize molecules; inversely, lead optimization can benefit from the innovations in de novo design by approaching it as a task of generating molecules conditioned on certain substructures.

LongVQ: Long Sequence Modeling with Vector Quantization on Structured Memory

no code implementations17 Apr 2024 Zicheng Liu, Li Wang, Siyuan Li, Zedong Wang, Haitao Lin, Stan Z. Li

Transformer models have been successful in various sequence processing tasks, but the self-attention mechanism's computational cost limits its practicality for long sequences.

Computational Efficiency Language Modelling +1

Deep Geometry Handling and Fragment-wise Molecular 3D Graph Generation

no code implementations15 Mar 2024 Odin Zhang, Yufei Huang, Shichen Cheng, Mengyao Yu, Xujun Zhang, Haitao Lin, Yundian Zeng, Mingyang Wang, Zhenxing Wu, Huifeng Zhao, Zaixi Zhang, Chenqing Hua, Yu Kang, Sunliang Cui, Peichen Pan, Chang-Yu Hsieh, Tingjun Hou

Most earlier 3D structure-based molecular generation approaches follow an atom-wise paradigm, incrementally adding atoms to a partially built molecular fragment within protein pockets.

Graph Generation

A Teacher-Free Graph Knowledge Distillation Framework with Dual Self-Distillation

1 code implementation6 Mar 2024 Lirong Wu, Haitao Lin, Zhangyang Gao, Guojiang Zhao, Stan Z. Li

As a result, TGS enjoys the benefits of graph topology awareness in training but is free from data dependency in inference.

Knowledge Distillation

PPFlow: Target-aware Peptide Design with Torsional Flow Matching

1 code implementation5 Mar 2024 Haitao Lin, Odin Zhang, Huifeng Zhao, Dejun Jiang, Lirong Wu, Zicheng Liu, Yufei Huang, Stan Z. Li

Therapeutic peptides have proven to have great pharmaceutical value and potential in recent decades.

Drug Discovery

Decoupling Weighing and Selecting for Integrating Multiple Graph Pre-training Tasks

1 code implementation3 Mar 2024 Tianyu Fan, Lirong Wu, Yufei Huang, Haitao Lin, Cheng Tan, Zhangyang Gao, Stan Z. Li

In this paper, we identify two important collaborative processes for this topic: (1) select: how to select an optimal task combination from a given task pool based on their compatibility, and (2) weigh: how to weigh the selected tasks based on their importance.

Graph Representation Learning

MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein Embedding

1 code implementation22 Feb 2024 Lirong Wu, Yijun Tian, Yufei Huang, Siyuan Li, Haitao Lin, Nitesh V Chawla, Stan Z. Li

In addition, microenvironments defined in previous work are largely based on experimentally assayed physicochemical properties, for which the "vocabulary" is usually extremely small.

Computational Efficiency

Re-Dock: Towards Flexible and Realistic Molecular Docking with Diffusion Bridge

no code implementations18 Feb 2024 Yufei Huang, Odin Zhang, Lirong Wu, Cheng Tan, Haitao Lin, Zhangyang Gao, Siyuan Li, Stan. Z. Li

Accurate prediction of protein-ligand binding structures, a task known as molecular docking is crucial for drug design but remains challenging.

Molecular Docking

PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for Efficient and Generalizable Compound-Protein Interaction Prediction

1 code implementation13 Feb 2024 Lirong Wu, Yufei Huang, Cheng Tan, Zhangyang Gao, Bozhen Hu, Haitao Lin, Zicheng Liu, Stan Z. Li

Compound-Protein Interaction (CPI) prediction aims to predict the pattern and strength of compound-protein interactions for rational drug discovery.

Drug Discovery

Protein 3D Graph Structure Learning for Robust Structure-based Protein Property Prediction

no code implementations14 Oct 2023 Yufei Huang, Siyuan Li, Jin Su, Lirong Wu, Odin Zhang, Haitao Lin, Jingqi Qi, Zihan Liu, Zhangyang Gao, Yuyang Liu, Jiangbin Zheng, Stan. ZQ. Li

To study this problem, we identify a Protein 3D Graph Structure Learning Problem for Robust Protein Property Prediction (PGSL-RP3), collect benchmark datasets, and present a protein Structure embedding Alignment Optimization framework (SAO) to mitigate the problem of structure embedding bias between the predicted and experimental protein structures.

Graph structure learning Property Prediction +2

WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model

no code implementations30 Aug 2023 Tianyu Wang, YiFan Li, Haitao Lin, xiangyang xue, Yanwei Fu

The target instruction is then forwarded to a visual grounding system for object pose and size estimation, following which the robot grasps the object accordingly.

Language Modelling Large Language Model +3

CFSum: A Coarse-to-Fine Contribution Network for Multimodal Summarization

1 code implementation6 Jul 2023 Min Xiao, Junnan Zhu, Haitao Lin, Yu Zhou, Chengqing Zong

Therefore, we propose a novel Coarse-to-Fine contribution network for multimodal Summarization (CFSum) to consider different contributions of images for summarization.

Quantifying the Knowledge in GNNs for Reliable Distillation into MLPs

1 code implementation9 Jun 2023 Lirong Wu, Haitao Lin, Yufei Huang, Stan Z. Li

To bridge the gaps between topology-aware Graph Neural Networks (GNNs) and inference-efficient Multi-Layer Perceptron (MLPs), GLNN proposes to distill knowledge from a well-trained teacher GNN into a student MLP.

GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation

1 code implementation CVPR 2023 Jingyang Huo, Qiang Sun, Boyan Jiang, Haitao Lin, Yanwei Fu

Technically, we introduce a two-stage module that combine local slot attention and CLIP model to produce geometry-enhanced representation from such input.

Vision and Language Navigation

Extracting Low-/High- Frequency Knowledge from Graph Neural Networks and Injecting it into MLPs: An Effective GNN-to-MLP Distillation Framework

1 code implementation18 May 2023 Lirong Wu, Haitao Lin, Yufei Huang, Tianyu Fan, Stan Z. Li

Furthermore, we identified a potential information drowning problem for existing GNN-to-MLP distillation, i. e., the high-frequency knowledge of the pre-trained GNNs may be overwhelmed by the low-frequency knowledge during distillation; we have described in detail what it represents, how it arises, what impact it has, and how to deal with it.

Data-Efficient Protein 3D Geometric Pretraining via Refinement of Diffused Protein Structure Decoy

no code implementations5 Feb 2023 Yufei Huang, Lirong Wu, Haitao Lin, Jiangbin Zheng, Ge Wang, Stan Z. Li

Learning meaningful protein representation is important for a variety of biological downstream tasks such as structure-based drug design.

Diversity

A Survey on Protein Representation Learning: Retrospect and Prospect

1 code implementation31 Dec 2022 Lirong Wu, Yufei Huang, Haitao Lin, Stan Z. Li

To pave the way for AI researchers with little bioinformatics background, we present a timely and comprehensive review of PRL formulations and existing PRL methods from the perspective of model architectures, pretext tasks, and downstream applications.

Representation Learning

DiffBP: Generative Diffusion of 3D Molecules for Target Protein Binding

1 code implementation21 Nov 2022 Haitao Lin, Yufei Huang, Odin Zhang, Siqi Ma, Meng Liu, Xuanjing Li, Lirong Wu, Jishui Wang, Tingjun Hou, Stan Z. Li

Previous works usually generate atoms in an auto-regressive way, where element types and 3D coordinates of atoms are generated one by one.

Drug Discovery

MogaNet: Multi-order Gated Aggregation Network

6 code implementations7 Nov 2022 Siyuan Li, Zedong Wang, Zicheng Liu, Cheng Tan, Haitao Lin, Di wu, ZhiYuan Chen, Jiangbin Zheng, Stan Z. Li

Notably, MogaNet hits 80. 0\% and 87. 8\% accuracy with 5. 2M and 181M parameters on ImageNet-1K, outperforming ParC-Net and ConvNeXt-L, while saving 59\% FLOPs and 17M parameters, respectively.

3D Human Pose Estimation Image Classification +6

Teaching Yourself: Graph Self-Distillation on Neighborhood for Node Classification

no code implementations5 Oct 2022 Lirong Wu, Jun Xia, Haitao Lin, Zhangyang Gao, Zicheng Liu, Guojiang Zhao, Stan Z. Li

Despite their great academic success, Multi-Layer Perceptrons (MLPs) remain the primary workhorse for practical industrial applications.

Classification Node Classification

Exploring Generative Neural Temporal Point Process

1 code implementation3 Aug 2022 Haitao Lin, Lirong Wu, Guojiang Zhao, Pai Liu, Stan Z. Li

While lots of previous works have focused on `goodness-of-fit' of TPP models by maximizing the likelihood, their predictive performance is unsatisfactory, which means the timestamps generated by models are far apart from true observations.

Denoising

I Know What You Draw: Learning Grasp Detection Conditioned on a Few Freehand Sketches

no code implementations9 May 2022 Haitao Lin, Chilam Cheang, Yanwei Fu, xiangyang xue

The physical robot experiments confirm the utility of our method in object-cluttered scenes.

Learning 6-DoF Object Poses to Grasp Category-level Objects by Language Instructions

no code implementations9 May 2022 Chilam Cheang, Haitao Lin, Yanwei Fu, xiangyang xue

This paper studies the task of any objects grasping from the known categories by free-form language instructions.

Object Object Localization +1

STONet: A Neural-Operator-Driven Spatio-temporal Network

no code implementations18 Apr 2022 Haitao Lin, Guojiang Zhao, Lirong Wu, Stan Z. Li

Graph-based spatio-temporal neural networks are effective to model the spatial dependency among discrete points sampled irregularly from unstructured grids, thanks to the great expressiveness of graph neural networks.

Missing Values Time Series +1

An Empirical Study: Extensive Deep Temporal Point Process

1 code implementation19 Oct 2021 Haitao Lin, Cheng Tan, Lirong Wu, Zhangyang Gao, Stan. Z. Li

In this paper, we first review recent research emphasis and difficulties in modeling asynchronous event sequences with deep temporal point process, which can be concluded into four fields: encoding of history sequence, formulation of conditional intensity function, relational discovery of events and learning approaches for optimization.

Graph structure learning Variational Inference

Git: Clustering Based on Graph of Intensity Topology

3 code implementations4 Oct 2021 Zhangyang Gao, Haitao Lin, Cheng Tan, Lirong Wu, Stan. Z Li

\textbf{A}ccuracy, \textbf{R}obustness to noises and scales, \textbf{I}nterpretability, \textbf{S}peed, and \textbf{E}asy to use (ARISE) are crucial requirements of a good clustering algorithm.

Clustering Clustering Algorithms Evaluation

CSDS: A Fine-Grained Chinese Dataset for Customer Service Dialogue Summarization

2 code implementations EMNLP 2021 Haitao Lin, Liqun Ma, Junnan Zhu, Lu Xiang, Yu Zhou, Jiajun Zhang, Chengqing Zong

Therefore, in this paper, we introduce a novel Chinese dataset for Customer Service Dialogue Summarization (CSDS).

SAR-Net: Shape Alignment and Recovery Network for Category-level 6D Object Pose and Size Estimation

no code implementations CVPR 2022 Haitao Lin, Zichang Liu, Chilam Cheang, Yanwei Fu, Guodong Guo, xiangyang xue

The concatenation of the observed point cloud and symmetric one reconstructs a coarse object shape, thus facilitating object center (3D translation) and 3D size estimation.

Object Optical Character Recognition (OCR)

Self-supervised Learning on Graphs: Contrastive, Generative,or Predictive

1 code implementation16 May 2021 Lirong Wu, Haitao Lin, Zhangyang Gao, Cheng Tan, Stan. Z. Li

In this survey, we extend the concept of SSL, which first emerged in the fields of computer vision and natural language processing, to present a timely and comprehensive review of existing SSL techniques for graph data.

Self-Supervised Learning

Conditional Local Convolution for Spatio-temporal Meteorological Forecasting

1 code implementation4 Jan 2021 Haitao Lin, Zhangyang Gao, Yongjie Xu, Lirong Wu, Ling Li, Stan. Z. Li

We further propose the distance and orientation scaling terms to reduce the impacts of irregular spatial distribution.

Spatio-Temporal Forecasting Weather Forecasting

Towards Robust Graph Neural Networks against Label Noise

no code implementations1 Jan 2021 Jun Xia, Haitao Lin, Yongjie Xu, Lirong Wu, Zhangyang Gao, Siyuan Li, Stan Z. Li

A pseudo label is computed from the neighboring labels for each node in the training set using LP; meta learning is utilized to learn a proper aggregation of the original and pseudo label as the final label.

Attribute Learning with noisy labels +3

LookHops: light multi-order convolution and pooling for graph classification

no code implementations28 Dec 2020 Zhangyang Gao, Haitao Lin, Stan. Z Li

Convolution and pooling are the key operations to learn hierarchical representation for graph classification, where more expressive $k$-order($k>1$) method requires more computation cost, limiting the further applications.

General Classification Graph Classification

Invertible Manifold Learning for Dimension Reduction

1 code implementation7 Oct 2020 Siyuan Li, Haitao Lin, Zelin Zang, Lirong Wu, Jun Xia, Stan Z. Li

Dimension reduction (DR) aims to learn low-dimensional representations of high-dimensional data with the preservation of essential information.

Dimensionality Reduction

Clustering Based on Graph of Density Topology

1 code implementation24 Sep 2020 Zhangyang Gao, Haitao Lin, Stan Z. Li

GDT jointly considers the local and global structures of data samples: firstly forming local clusters based on a density growing process with a strategy for properly noise handling as well as cluster boundary detection; and then estimating a GDT from relationship between local clusters in terms of a connectivity measure, givingglobal topological graph.

Boundary Detection Clustering

Pulsars Detection by Machine Learning with Very Few Features

no code implementations20 Feb 2020 Haitao Lin, Xiangru Li, Ziying Luo

In this work, two feature selection algorithms ----\textit{Grid Search} (GS) and \textit{Recursive Feature Elimination} (RFE)---- are proposed to improve the detection performance by removing the redundant and irrelevant features.

BIG-bench Machine Learning feature selection

Cannot find the paper you are looking for? You can Submit a new open access paper.