Search Results for author: Stan Z. Li

Found 221 papers, 109 papers with code

Beyond 3DMM Space: Towards Fine-grained 3D Face Reconstruction

1 code implementation ECCV 2020 Xiangyu Zhu, Fan Yang, Di Huang, Chang Yu, Hao Wang, Jianzhu Guo, Zhen Lei, Stan Z. Li

However, most of their training data is constructed by 3D Morphable Model, whose space spanned is only a small part of the shape space.

3D Face Reconstruction

Unlasting: Unpaired Single-Cell Multi-Perturbation Estimation by Dual Conditional Diffusion Implicit Bridges

no code implementations26 Jun 2025 Changxi Chi, Jun Xia, Yufei Huang, Jingbo Zhou, Siyuan Li, Yunfan Liu, Chang Yu, Stan Z. Li

In addition, we introduce a biologically grounded evaluation metric that better reflects the inherent heterogeneity in single-cell responses.

Data Augmentation

AlphaFold Database Debiasing for Robust Inverse Folding

no code implementations10 Jun 2025 Cheng Tan, Zhenxiao Cao, Zhangyang Gao, Siyuan Li, Yufei Huang, Stan Z. Li

The AlphaFold Protein Structure Database (AFDB) offers unparalleled structural coverage at near-experimental accuracy, positioning it as a valuable resource for data-driven protein design.

Protein Design

PFMBench: Protein Foundation Model Benchmark

1 code implementation1 Jun 2025 Zhangyang Gao, Hao Wang, Cheng Tan, Chenrui Xu, Mengdi Liu, Bozhen Hu, Linlin Chao, XiaoMing Zhang, Stan Z. Li

To fill this gap, we present PFMBench, a comprehensive benchmark evaluating protein foundation models across 38 tasks spanning 8 key areas of protein science.

model

Tokenizing Electron Cloud in Protein-Ligand Interaction Learning

no code implementations25 May 2025 Haitao Lin, Odin Zhang, Jia Xu, Yunfan Liu, Zheng Cheng, Lirong Wu, Yufei Huang, Zhifeng Gao, Stan Z. Li

The affinity and specificity of protein-molecule binding directly impact functional outcomes, uncovering the mechanisms underlying biological regulation and signal transduction.

Knowledge Distillation Prediction +1

GRAPE: Heterogeneous Graph Representation Learning for Genetic Perturbation with Coding and Non-Coding Biotype

no code implementations6 May 2025 Changxi Chi, Jun Xia, Jingbo Zhou, Jiabei Cheng, Chang Yu, Stan Z. Li

We propose GRAPE, a heterogeneous graph neural network (HGNN) that leverages gene representations initialized with features from descriptions and sequences, models the distinct roles of genes with different biotypes, and dynamically refines the GRN through GSL.

Graph Neural Network Graph Representation Learning +2

Adversarial Curriculum Graph-Free Knowledge Distillation for Graph Neural Networks

no code implementations1 Apr 2025 Yuang Jia, Xiaojuan Shan, Jun Xia, Guancheng Wan, Yuchen Zhang, Wenke Huang, Mang Ye, Stan Z. Li

Data-free Knowledge Distillation (DFKD) is a method that constructs pseudo-samples using a generator without real data, and transfers knowledge from a teacher model to a student by enforcing the student to overcome dimensional differences and learn to mimic the teacher's outputs on these pseudo-samples.

Data-free Knowledge Distillation

Life-Code: Central Dogma Modeling with Multi-Omics Sequence Unification

no code implementations11 Feb 2025 Zicheng Liu, Siyuan Li, ZhiYuan Chen, Fang Wu, Chang Yu, Qirong Yang, Yucheng Guo, Yujie Yang, XiaoMing Zhang, Stan Z. Li

As for data flow, we propose a unified pipeline to integrate multi-omics data by reverse-transcribing RNA and reverse-translating amino acids into nucleotide-based sequences.

Knowledge Distillation

A Simple yet Effective DDG Predictor is An Unsupervised Antibody Optimizer and Explainer

1 code implementation10 Feb 2025 Lirong Wu, Yunfan Liu, Haitao Lin, Yufei Huang, Guojiang Zhao, Zhifeng Gao, Stan Z. Li

For the target antibody, we propose a novel Mutation Explainer to learn mutation preferences, which accounts for the marginal benefit of each mutation per residue.

G2PDiffusion: Genotype-to-Phenotype Prediction with Diffusion Models

no code implementations7 Feb 2025 Mengdi Liu, Zhangyang Gao, Hong Chang, Stan Z. Li, Shiguang Shan, Xinlin Chen

Discovering the genotype-phenotype relationship is crucial for genetic engineering, which will facilitate advances in fields such as crop breeding, conservation biology, and personalized medicine.

Conditional Image Generation Prediction

PhyloGen: Language Model-Enhanced Phylogenetic Inference via Graph Structure Generation

no code implementations25 Dec 2024 Chenrui Duan, Zelin Zang, Siyuan Li, Yongjie Xu, Stan Z. Li

Phylogenetic trees elucidate evolutionary relationships among species, but phylogenetic inference remains challenging due to the complexity of combining continuous (branch lengths) and discrete parameters (tree topology).

Language Modeling Language Modelling +1

Relation-Aware Equivariant Graph Networks for Epitope-Unknown Antibody Design and Specificity Optimization

no code implementations14 Dec 2024 Lirong Wu, Haitao Lin, Yufei Huang, Zhangyang Gao, Cheng Tan, Yunfan Liu, Tailin Wu, Stan Z. Li

Antibodies are Y-shaped proteins that protect the host by binding to specific antigens, and their binding is mainly determined by the Complementary Determining Regions (CDRs) in the antibody.

Relation Specificity

DapPep: Domain Adaptive Peptide-agnostic Learning for Universal T-cell Receptor-antigen Binding Affinity Prediction

no code implementations26 Nov 2024 Jiangbin Zheng, Qianhui Xu, Ruichen Xia, Stan Z. Li

Identifying T-cell receptors (TCRs) that interact with antigenic peptides provides the technical basis for developing vaccines and immunotherapies.

Language Modeling Protein Language Model +1

Pan-protein Design Learning Enables Task-adaptive Generalization for Low-resource Enzyme Design

no code implementations26 Nov 2024 Jiangbin Zheng, Ge Wang, Han Zhang, Stan Z. Li

Computational protein design (CPD) offers transformative potential for bioengineering, but current deep CPD models, focused on universal domains, struggle with function-specific designs.

Decoder Protein Design

Revisiting Marr in Face: The Building of 2D--2.5D--3D Representations in Deep Neural Networks

no code implementations25 Nov 2024 Xiangyu Zhu, Chang Yu, Jiankuo Zhao, Zhaoxiang Zhang, Stan Z. Li, Zhen Lei

By injecting graphics probes into neural networks, and analyzing their behavior in reconstructing images, we find that DNNs initially encode images as 2D representations in low-level layers, and finally construct 3D representations in high-level layers.

MeToken: Uniform Micro-environment Token Boosts Post-Translational Modification Prediction

1 code implementation4 Nov 2024 Cheng Tan, Zhenxiao Cao, Zhangyang Gao, Lirong Wu, Siyuan Li, Yufei Huang, Jun Xia, Bozhen Hu, Stan Z. Li

Post-translational modifications (PTMs) profoundly expand the complexity and functionality of the proteome, regulating protein attributes and interactions that are crucial for biological processes.

FlexMol: A Flexible Toolkit for Benchmarking Molecular Relational Learning

1 code implementation19 Oct 2024 Sizhe Liu, Jun Xia, Lecheng Zhang, Yuchen Liu, Yue Liu, Wenjie Du, Zhangyang Gao, Bozhen Hu, Cheng Tan, Hongxin Xiang, Stan Z. Li

Molecular relational learning (MRL) is crucial for understanding the interaction behaviors between molecular pairs, a critical aspect of drug discovery and development.

Benchmarking Drug Discovery +1

Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning

no code implementations8 Oct 2024 Siyuan Li, Juanxi Tian, Zedong Wang, Luyuan Zhang, Zicheng Liu, Weiyang Jin, Yang Liu, Baigui Sun, Stan Z. Li

This paper delves into the interplay between vision backbones and optimizers, unvealing an inter-dependent phenomenon termed \textit{\textbf{b}ackbone-\textbf{o}ptimizer \textbf{c}oupling \textbf{b}ias} (BOCB).

Representation Learning

A Review of BioTree Construction in the Context of Information Fusion: Priors, Methods, Applications and Trends

no code implementations7 Oct 2024 Zelin Zang, Yongjie Xu, Chenrui Duan, Yue Yuan, Jinlin Wu, Zhen Lei, Stan Z. Li

By addressing the challenges of data complexity and prior knowledge integration, this review aims to inspire interdisciplinary innovation at the intersection of biology and DL.

Deep Learning

Learning to Model Graph Structural Information on MLPs via Graph Structure Self-Contrasting

1 code implementation9 Sep 2024 Lirong Wu, Haitao Lin, Guojiang Zhao, Cheng Tan, Stan Z. Li

In this paper, we rethink the roles played by graph structural information in graph data training and identify that message passing is not the only path to modeling structural information.

A Survey on Mixup Augmentations and Beyond

1 code implementation8 Sep 2024 Xin Jin, Hongyu Zhu, Siyuan Li, Zedong Wang, Zicheng Liu, Chang Yu, Huafeng Qin, Stan Z. Li

As Deep Neural Networks have achieved thrilling breakthroughs in the past decade, data augmentations have garnered increasing attention as regularization techniques when massive labeled data are unavailable.

Image Classification Self-Supervised Learning +1

MetaEnzyme: Meta Pan-Enzyme Learning for Task-Adaptive Redesign

no code implementations5 Aug 2024 Jiangbin Zheng, Han Zhang, Qianqing Xu, An-Ping Zeng, Stan Z. Li

Enzyme design plays a crucial role in both industrial production and biology.

Teach Harder, Learn Poorer: Rethinking Hard Sample Distillation for GNN-to-MLP Knowledge Distillation

1 code implementation20 Jul 2024 Lirong Wu, Yunfan Liu, Haitao Lin, Yufei Huang, Stan Z. Li

To bridge the gaps between powerful Graph Neural Networks (GNNs) and lightweight Multi-Layer Perceptron (MLPs), GNN-to-MLP Knowledge Distillation (KD) proposes to distill knowledge from a well-trained teacher GNN into a student MLP.

Knowledge Distillation

The Heterophilic Graph Learning Handbook: Benchmarks, Models, Theoretical Analysis, Applications and Challenges

no code implementations12 Jul 2024 Sitao Luan, Chenqing Hua, Qincheng Lu, Liheng Ma, Lirong Wu, Xinyu Wang, Minkai Xu, Xiao-Wen Chang, Doina Precup, Rex Ying, Stan Z. Li, Jian Tang, Guy Wolf, Stefanie Jegelka

In this survey, we provide a comprehensive review of the latest progress on heterophilic graph learning, including an extensive summary of benchmark datasets and evaluation of homophily metrics on synthetic graphs, meticulous classification of the most updated supervised and unsupervised learning methods, thorough digestion of the theoretical analysis on homophily/heterophily, and broad exploration of the heterophily-related applications.

Graph Learning Graph Representation Learning

BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science

1 code implementation29 Jun 2024 Xinna Lin, Siqi Ma, Junjie Shan, Xiaojing Zhang, Shell Xu Hu, Tiannan Guo, Stan Z. Li, Kaicheng Yu

On the widely used popular knowledge graph, we discover over 90 factual errors which provide scenarios for agents to make discoveries and demonstrate the effectiveness of our approach.

AI Agent Claim Verification +5

NovoBench: Benchmarking Deep Learning-based De Novo Peptide Sequencing Methods in Proteomics

no code implementations16 Jun 2024 Jingbo Zhou, Shaorong Chen, Jun Xia, Sizhe Liu, Tianze Ling, Wenjie Du, Yue Liu, Jianwei Yin, Stan Z. Li

In this work, we present the first unified benchmark NovoBench for \emph{de novo} peptide sequencing, which comprises diverse mass spectrum data, integrated models, and comprehensive evaluation metrics.

Benchmarking de novo peptide sequencing

CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph

1 code implementation16 Jun 2024 Haitao Lin, Guojiang Zhao, Odin Zhang, Yufei Huang, Lirong Wu, Zicheng Liu, Siyuan Li, Cheng Tan, Zhifeng Gao, Stan Z. Li

To broaden the scope, we have adapted these models to a range of tasks essential in drug design, which are considered sub-tasks within the graph fill-in-the-blank tasks.

Drug Design Fairness

Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long Sequences

no code implementations12 Jun 2024 Zicheng Liu, Siyuan Li, Li Wang, Zedong Wang, Yunfan Liu, Stan Z. Li

To mitigate the computational complexity in the self-attention mechanism on long sequences, linear attention utilizes computation tricks to achieve linear complexity, while state space models (SSMs) popularize a favorable practice of using non-data-dependent memory pattern, i. e., emphasize the near and neglect the distant, to processing sequences.

Language Modeling Long-range modeling +1

FoldToken2: Learning compact, invariant and generative protein structure language

no code implementations11 Jun 2024 Zhangyang Gao, Cheng Tan, Stan Z. Li

The equivalent nature of 3D coordinates has posed long term challenges in protein structure representation learning, alignment, and generation.

Decoder Quantization +1

Set-CLIP: Exploring Aligned Semantic From Low-Alignment Multimodal Data Through A Distribution View

no code implementations9 Jun 2024 Zijia Song, Zelin Zang, Yelin Wang, Guozheng Yang, Kaicheng Yu, Wanyu Chen, Miaoyu Wang, Stan Z. Li

Multimodal fusion breaks through the boundaries between diverse modalities and has already achieved notable performances.

Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based Interactions

1 code implementation9 Jun 2024 Cheng Tan, Dongxin Lyu, Siyuan Li, Zhangyang Gao, Jingxuan Wei, Siqi Ma, Zicheng Liu, Stan Z. Li

Large Language Models (LLMs) have demonstrated wide-ranging applications across various fields and have shown significant potential in the academic peer-review process.

Review Generation

GenBench: A Benchmarking Suite for Systematic Evaluation of Genomic Foundation Models

1 code implementation1 Jun 2024 Zicheng Liu, Jiahui Li, Siyuan Li, Zelin Zang, Cheng Tan, Yufei Huang, Yajing Bai, Stan Z. Li

The Genomic Foundation Model (GFM) paradigm is expected to facilitate the extraction of generalizable representations from massive genomic data, thereby enabling their application across a spectrum of downstream applications.

Benchmarking

Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning

no code implementations31 May 2024 Cheng Tan, Jingxuan Wei, Linzhuang Sun, Zhangyang Gao, Siyuan Li, Bihui Yu, Ruifeng Guo, Stan Z. Li

Large language models equipped with retrieval-augmented generation (RAG) represent a burgeoning field aimed at enhancing answering capabilities by leveraging external knowledge bases.

Answer Generation Multimodal Reasoning +3

UniIF: Unified Molecule Inverse Folding

no code implementations29 May 2024 Zhangyang Gao, Jue Wang, Cheng Tan, Lirong Wu, Yufei Huang, Siyuan Li, Zhirui Ye, Stan Z. Li

We do such unification in two levels: 1) Data-Level: We propose a unified block graph data form for all molecules, including the local frame building and geometric feature initialization.

All Drug Discovery +1

Learning to Predict Mutation Effects of Protein-Protein Interactions by Microenvironment-aware Hierarchical Prompt Learning

1 code implementation16 May 2024 Lirong Wu, Yijun Tian, Haitao Lin, Yufei Huang, Siyuan Li, Nitesh V Chawla, Stan Z. Li

Protein-protein bindings play a key role in a variety of fundamental biological processes, and thus predicting the effects of amino acid mutations on protein-protein binding is crucial.

Prompt Learning

VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling

no code implementations13 May 2024 Siyuan Li, Zedong Wang, Zicheng Liu, Di wu, Cheng Tan, Jiangbin Zheng, Yufei Huang, Stan Z. Li

In this paper, we introduce VQDNA, a general-purpose framework that renovates genome tokenization from the perspective of genome vocabulary learning.

Quantization

LongVQ: Long Sequence Modeling with Vector Quantization on Structured Memory

no code implementations17 Apr 2024 Zicheng Liu, Li Wang, Siyuan Li, Zedong Wang, Haitao Lin, Stan Z. Li

Transformer models have been successful in various sequence processing tasks, but the self-attention mechanism's computational cost limits its practicality for long sequences.

Computational Efficiency Language Modeling +3

Advances of Deep Learning in Protein Science: A Comprehensive Survey

no code implementations8 Mar 2024 Bozhen Hu, Cheng Tan, Lirong Wu, Jiangbin Zheng, Jun Xia, Zhangyang Gao, Zicheng Liu, Fandi Wu, Guijun Zhang, Stan Z. Li

Protein representation learning plays a crucial role in understanding the structure and function of proteins, which are essential biomolecules involved in various biological processes.

Deep Learning Drug Discovery +4

A Teacher-Free Graph Knowledge Distillation Framework with Dual Self-Distillation

1 code implementation6 Mar 2024 Lirong Wu, Haitao Lin, Zhangyang Gao, Guojiang Zhao, Stan Z. Li

As a result, TGS enjoys the benefits of graph topology awareness in training but is free from data dependency in inference.

Knowledge Distillation

Decoupling Weighing and Selecting for Integrating Multiple Graph Pre-training Tasks

1 code implementation3 Mar 2024 Tianyu Fan, Lirong Wu, Yufei Huang, Haitao Lin, Cheng Tan, Zhangyang Gao, Stan Z. Li

In this paper, we identify two important collaborative processes for this topic: (1) select: how to select an optimal task combination from a given task pool based on their compatibility, and (2) weigh: how to weigh the selected tasks based on their importance.

Graph Representation Learning

FGBERT: Function-Driven Pre-trained Gene Language Model for Metagenomics

no code implementations24 Feb 2024 Chenrui Duan, Zelin Zang, Yongjie Xu, Hang He, Zihan Liu, Siyuan Li, Zijia Song, Ju-Sheng Zheng, Stan Z. Li

Metagenomic data, comprising mixed multi-species genomes, are prevalent in diverse environments like oceans and soils, significantly impacting human health and ecological functions.

Contrastive Learning Language Modeling +2

MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein Embedding

1 code implementation22 Feb 2024 Lirong Wu, Yijun Tian, Yufei Huang, Siyuan Li, Haitao Lin, Nitesh V Chawla, Stan Z. Li

In addition, microenvironments defined in previous work are largely based on experimentally assayed physicochemical properties, for which the "vocabulary" is usually extremely small.

Computational Efficiency Prediction

Switch EMA: A Free Lunch for Better Flatness and Sharpness

3 code implementations14 Feb 2024 Siyuan Li, Zicheng Liu, Juanxi Tian, Ge Wang, Zedong Wang, Weiyang Jin, Di wu, Cheng Tan, Tao Lin, Yang Liu, Baigui Sun, Stan Z. Li

Exponential Moving Average (EMA) is a widely used weight averaging (WA) regularization to learn flat optima for better generalizations without extra cost in deep neural network (DNN) optimization.

Attribute image-classification +9

PSC-CPI: Multi-Scale Protein Sequence-Structure Contrasting for Efficient and Generalizable Compound-Protein Interaction Prediction

1 code implementation13 Feb 2024 Lirong Wu, Yufei Huang, Cheng Tan, Zhangyang Gao, Bozhen Hu, Haitao Lin, Zicheng Liu, Stan Z. Li

Compound-Protein Interaction (CPI) prediction aims to predict the pattern and strength of compound-protein interactions for rational drug discovery.

Drug Discovery

A Graph is Worth $K$ Words: Euclideanizing Graph using Pure Transformer

1 code implementation4 Feb 2024 Zhangyang Gao, Daize Dong, Cheng Tan, Jun Xia, Bozhen Hu, Stan Z. Li

(4) The edge-centric pretraining framework GraphsGPT demonstrates its efficacy in graph domain tasks, excelling in both representation and generation.

Decoder Graph Classification +2

MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning

no code implementations CVPR 2024 Zhe Li, Laurence T. Yang, Bocheng Ren, Xin Nie, Zhangyang Gao, Cheng Tan, Stan Z. Li

The scarcity of annotated data has sparked significant interest in unsupervised pre-training methods that leverage medical reports as auxiliary signals for medical visual representation learning.

Contrastive Learning image-classification +6

Must: Maximizing Latent Capacity of Spatial Transcriptomics Data

1 code implementation15 Jan 2024 Zelin Zang, Liangyu Li, Yongjie Xu, Chenrui Duan, Kai Wang, Yang You, Yi Sun, Stan Z. Li

MuST integrates the multi-modality information contained in the ST data effectively into a uniform latent space to provide a foundation for all the downstream tasks.

Deep Manifold Transformation for Protein Representation Learning

no code implementations12 Jan 2024 Bozhen Hu, Zelin Zang, Cheng Tan, Stan Z. Li

Protein representation learning is critical in various tasks in biology, such as drug design and protein structure or function prediction, which has primarily benefited from protein language models and graph neural networks.

Drug Design Representation Learning

Deep Manifold Graph Auto-Encoder for Attributed Graph Embedding

no code implementations12 Jan 2024 Bozhen Hu, Zelin Zang, Jun Xia, Lirong Wu, Cheng Tan, Stan Z. Li

Representing graph data in a low-dimensional space for subsequent tasks is the purpose of attributed graph embedding.

Graph Embedding

Graph-level Protein Representation Learning by Structure Knowledge Refinement

no code implementations5 Jan 2024 Ge Wang, Zelin Zang, Jiangbin Zheng, Jun Xia, Stan Z. Li

The mainstream method is utilizing contrastive learning to facilitate graph feature extraction, known as Graph Contrastive Learning (GCL).

Contrastive Learning Property Prediction +1

General Point Model Pretraining with Autoencoding and Autoregressive

1 code implementation CVPR 2024 Zhe Li, Zhangyang Gao, Cheng Tan, Bocheng Ren, Laurence T. Yang, Stan Z. Li

Compared to models like Point-BERT MaskPoint and PointMAE our GPM achieves superior performance in point cloud understanding tasks.

Decoder Language Modeling +5

Masked Modeling for Self-supervised Representation Learning on Vision and Beyond

1 code implementation31 Dec 2023 Siyuan Li, Luyuan Zhang, Zedong Wang, Di wu, Lirong Wu, Zicheng Liu, Jun Xia, Cheng Tan, Yang Liu, Baigui Sun, Stan Z. Li

As the deep learning revolution marches on, self-supervised learning has garnered increasing attention in recent years thanks to its remarkable representation learning ability and the low dependence on labeled data.

Representation Learning Self-Supervised Learning +1

Progressive Multi-Modality Learning for Inverse Protein Folding

1 code implementation11 Dec 2023 Jiangbin Zheng, Stan Z. Li

While deep generative models show promise for learning inverse protein folding directly from data, the lack of publicly available structure-sequence pairings limits their generalization.

cross-modal alignment Data Augmentation +5

Efficiently Predicting Protein Stability Changes Upon Single-point Mutation with Large Language Models

no code implementations7 Dec 2023 Yijie Zhang, Zhangyang Gao, Cheng Tan, Stan Z. Li

Predicting protein stability changes induced by single-point mutations has been a persistent challenge over the years, attracting immense interest from numerous researchers.

Computational Efficiency

Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models with Self-Consistency Training

1 code implementation23 Nov 2023 Cheng Tan, Jingxuan Wei, Zhangyang Gao, Linzhuang Sun, Siyuan Li, Ruifeng Guo, Bihui Yu, Stan Z. Li

Remarkably, we show that even smaller base models, when equipped with our proposed approach, can achieve results comparable to those of larger models, illustrating the potential of our approach in harnessing the power of rationales for improved multimodal reasoning.

Multimodal Reasoning Science Question Answering +1

Segment Anything in Defect Detection

no code implementations17 Nov 2023 Bozhen Hu, Bin Gao, Cheng Tan, Tongle Wu, Stan Z. Li

Defect detection plays a crucial role in infrared non-destructive testing systems, offering non-contact, safe, and efficient inspection capabilities.

Defect Detection

General Point Model with Autoencoding and Autoregressive

no code implementations25 Oct 2023 Zhe Li, Zhangyang Gao, Cheng Tan, Stan Z. Li, Laurence T. Yang

This model is versatile, allowing fine-tuning for downstream point cloud representation tasks, as well as unconditional and conditional generation tasks.

Decoder Language Modeling +5

USTEP: Spatio-Temporal Predictive Learning under A Unified View

no code implementations9 Oct 2023 Cheng Tan, Jue Wang, Zhangyang Gao, Siyuan Li, Stan Z. Li

In this paper, we re-examine the two dominant temporal modeling approaches within the realm of spatio-temporal predictive learning, offering a unified perspective.

Self-Supervised Learning

CONVERT:Contrastive Graph Clustering with Reliable Augmentation

2 code implementations17 Aug 2023 Xihong Yang, Cheng Tan, Yue Liu, Ke Liang, Siwei Wang, Sihang Zhou, Jun Xia, Stan Z. Li, Xinwang Liu, En Zhu

To address these problems, we propose a novel CONtrastiVe Graph ClustEring network with Reliable AugmenTation (CONVERT).

Clustering Contrastive Learning +4

Reinforcement Graph Clustering with Unknown Cluster Number

2 code implementations13 Aug 2023 Yue Liu, Ke Liang, Jun Xia, Xihong Yang, Sihang Zhou, Meng Liu, Xinwang Liu, Stan Z. Li

To enable the deep graph clustering algorithms to work without the guidance of the predefined cluster number, we propose a new deep graph clustering method termed Reinforcement Graph Clustering (RGC).

Clustering Graph Clustering +1

Enhancing Human-like Multi-Modal Reasoning: A New Challenging Dataset and Comprehensive Framework

1 code implementation24 Jul 2023 Jingxuan Wei, Cheng Tan, Zhangyang Gao, Linzhuang Sun, Siyuan Li, Bihui Yu, Ruifeng Guo, Stan Z. Li

Multimodal reasoning is a critical component in the pursuit of artificial intelligence systems that exhibit human-like intelligence, especially when tackling complex tasks.

Contrastive Learning Multimodal Reasoning +2

Efficient Prediction of Peptide Self-assembly through Sequential and Graphical Encoding

1 code implementation17 Jul 2023 Zihan Liu, Jiaqi Wang, Yun Luo, Shuang Zhao, Wenbin Li, Stan Z. Li

In recent years, there has been an explosion of research on the application of deep learning to the prediction of various peptide properties, due to the significant development and market potential of peptides.

Benchmarking Deep Learning +1

Why Deep Models Often cannot Beat Non-deep Counterparts on Molecular Property Prediction?

no code implementations30 Jun 2023 Jun Xia, Lecheng Zhang, Xiao Zhu, Stan Z. Li

Molecular property prediction (MPP) is a crucial task in the drug discovery pipeline, which has recently gained considerable attention thanks to advances in deep neural networks.

Drug Discovery Molecular Property Prediction +1

OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning

2 code implementations NeurIPS 2023 Cheng Tan, Siyuan Li, Zhangyang Gao, Wenfei Guan, Zedong Wang, Zicheng Liu, Lirong Wu, Stan Z. Li

Spatio-temporal predictive learning is a learning paradigm that enables models to learn spatial and temporal patterns by predicting future frames from given past frames in an unsupervised manner.

Weather Forecasting

Quantifying the Knowledge in GNNs for Reliable Distillation into MLPs

2 code implementations9 Jun 2023 Lirong Wu, Haitao Lin, Yufei Huang, Stan Z. Li

To bridge the gaps between topology-aware Graph Neural Networks (GNNs) and inference-efficient Multi-Layer Perceptron (MLPs), GLNN proposes to distill knowledge from a well-trained teacher GNN into a student MLP.

Dink-Net: Neural Clustering on Large Graphs

2 code implementations28 May 2023 Yue Liu, Ke Liang, Jun Xia, Sihang Zhou, Xihong Yang, Xinwang Liu, Stan Z. Li

Subsequently, the clustering distribution is optimized by minimizing the proposed cluster dilation loss and cluster shrink loss in an adversarial manner.

Clustering Graph Clustering +1

Knowledge-Design: Pushing the Limit of Protein Design via Knowledge Refinement

1 code implementation20 May 2023 Zhangyang Gao, Cheng Tan, Stan Z. Li

After witnessing the great success of pretrained models on diverse protein-related tasks and the fact that recovery is highly correlated with confidence, we wonder whether this knowledge can push the limits of protein design further.

Protein Design Retrieval +1

Extracting Low-/High- Frequency Knowledge from Graph Neural Networks and Injecting it into MLPs: An Effective GNN-to-MLP Distillation Framework

1 code implementation18 May 2023 Lirong Wu, Haitao Lin, Yufei Huang, Tianyu Fan, Stan Z. Li

Furthermore, we identified a potential information drowning problem for existing GNN-to-MLP distillation, i. e., the high-frequency knowledge of the pre-trained GNNs may be overwhelmed by the low-frequency knowledge during distillation; we have described in detail what it represents, how it arises, what impact it has, and how to deal with it.

Deep Graph Neural Networks via Posteriori-Sampling-based Node-Adaptive Residual Module

1 code implementation9 May 2023 Jingbo Zhou, Yixuan Du, Ruqiong Zhang, Jun Xia, Zhizhi Yu, Zelin Zang, Di Jin, Carl Yang, Rui Zhang, Stan Z. Li

Additionally, we reveal the drawbacks of previous residual methods, such as the lack of node adaptability and severe loss of high-order neighborhood subgraph information, and propose a \textbf{Posterior-Sampling-based, Node-Adaptive Residual module (PSNR)}.

Node Classification

High-dimensional Clustering onto Hamiltonian Cycle

no code implementations27 Apr 2023 Tianyi Huang, Shenghui Cheng, Stan Z. Li, Zhengjun Zhang

Then, the anchors of different clusters are sorted on the optimal Hamiltonian cycle generated by the cluster similarities and mapped on the circumference of a circle.

Clustering Deep Clustering +1

Cross-Gate MLP with Protein Complex Invariant Embedding is A One-Shot Antibody Designer

1 code implementation21 Apr 2023 Cheng Tan, Zhangyang Gao, Lirong Wu, Jun Xia, Jiangbin Zheng, Xihong Yang, Yue Liu, Bozhen Hu, Stan Z. Li

In this paper, we propose a \textit{simple yet effective} model that can co-design 1D sequences and 3D structures of CDRs in a one-shot manner.

Specificity

InstructBio: A Large-scale Semi-supervised Learning Paradigm for Biochemical Problems

1 code implementation8 Apr 2023 Fang Wu, Huiling Qin, Siyuan Li, Stan Z. Li, Xianyuan Zhan, Jinbo Xu

In the field of artificial intelligence for science, it is consistently an essential challenge to face a limited amount of labeled data for real-world problems.

molecular representation Representation Learning

Towards Reasonable Budget Allocation in Untargeted Graph Structure Attacks via Gradient Debias

1 code implementation29 Mar 2023 Zihan Liu, Yun Luo, Lirong Wu, Zicheng Liu, Stan Z. Li

It has become cognitive inertia to employ cross-entropy loss function in classification related tasks.

CCPL: Cross-modal Contrastive Protein Learning

no code implementations19 Mar 2023 Jiangbin Zheng, Stan Z. Li

Effective protein representation learning is crucial for predicting protein functions.

Language Modeling Masked Language Modeling +3

CVT-SLR: Contrastive Visual-Textual Transformation for Sign Language Recognition with Variational Alignment

1 code implementation CVPR 2023 Jiangbin Zheng, Yile Wang, Cheng Tan, Siyuan Li, Ge Wang, Jun Xia, Yidong Chen, Stan Z. Li

In this work, we propose a novel contrastive visual-textual transformation for SLR, CVT-SLR, to fully explore the pretrained knowledge of both the visual and language modalities.

cross-modal alignment Sign Language Recognition

PrefixMol: Target- and Chemistry-aware Molecule Design via Prefix Embedding

no code implementations14 Feb 2023 Zhangyang Gao, Yuqi Hu, Cheng Tan, Stan Z. Li

Is there a unified model for generating molecules considering different conditions, such as binding pockets and chemical properties?

Drug Design Multi-Task Learning

Improving (Dis)agreement Detection with Inductive Social Relation Information From Comment-Reply Interactions

1 code implementation8 Feb 2023 Yun Luo, Zihan Liu, Stan Z. Li, Yue Zhang

(Dis)agreement detection aims to identify the authors' attitudes or positions (\textit{{agree, disagree, neutral}}) towards a specific text.

Knowledge Graph Embedding Language Modelling +1

Characterization and Generation of 3D Realistic Geological Particles with Metaball Descriptor based on X-Ray Computed Tomography

no code implementations5 Feb 2023 Yifeng Zhao, Xiangbo Gao, Pei Zhang, Liang Lei, S. A. Galindo-Torres, Stan Z. Li

This algorithm can capture the main contour of parental particles with a series of non-overlapping spheres and refine surface-texture details through gradient search.

Data-Efficient Protein 3D Geometric Pretraining via Refinement of Diffused Protein Structure Decoy

no code implementations5 Feb 2023 Yufei Huang, Lirong Wu, Haitao Lin, Jiangbin Zheng, Ge Wang, Stan Z. Li

Learning meaningful protein representation is important for a variety of biological downstream tasks such as structure-based drug design.

Diversity Drug Design

RDesign: Hierarchical Data-efficient Representation Learning for Tertiary Structure-based RNA Design

1 code implementation25 Jan 2023 Cheng Tan, Yijie Zhang, Zhangyang Gao, Bozhen Hu, Siyuan Li, Zicheng Liu, Stan Z. Li

We crafted a large, well-curated benchmark dataset and designed a comprehensive structural modeling approach to represent the complex RNA tertiary structure.

Contrastive Learning Protein Design +2

DiffSDS: A language diffusion model for protein backbone inpainting under geometric conditions and constraints

1 code implementation22 Jan 2023 Zhangyang Gao, Cheng Tan, Stan Z. Li

Have you ever been troubled by the complexity and computational cost of SE(3) protein structure modeling and been amazed by the simplicity and power of language modeling?

Decoder Denoising +2

Rethinking Explaining Graph Neural Networks via Non-parametric Subgraph Matching

1 code implementation7 Jan 2023 Fang Wu, Siyuan Li, Xurui Jin, Yinghui Jiang, Dragomir Radev, Zhangming Niu, Stan Z. Li

It takes advantage of MatchExplainer to fix the most informative portion of the graph and merely operates graph augmentations on the rest less informative part.

Graph Sampling

Surveillance Face Anti-spoofing

no code implementations3 Jan 2023 Hao Fang, Ajian Liu, Jun Wan, Sergio Escalera, Chenxu Zhao, Xu Zhang, Stan Z. Li, Zhen Lei

In order to promote relevant research and fill this gap in the community, we collect a large-scale Surveillance High-Fidelity Mask (SuHiFiMask) dataset captured under 40 surveillance scenes, which has 101 subjects from different age groups with 232 3D attacks (high-fidelity masks), 200 2D attacks (posters, portraits, and screens), and 2 adversarial attacks.

Contrastive Learning Face Anti-Spoofing +2

A Survey on Protein Representation Learning: Retrospect and Prospect

1 code implementation31 Dec 2022 Lirong Wu, Yufei Huang, Haitao Lin, Stan Z. Li

To pave the way for AI researchers with little bioinformatics background, we present a timely and comprehensive review of PRL formulations and existing PRL methods from the perspective of model architectures, pretext tasks, and downstream applications.

Representation Learning Survey

Federated Learning for Inference at Anytime and Anywhere

no code implementations8 Dec 2022 Zicheng Liu, Da Li, Javier Fernandez-Marques, Stefanos Laskaridis, Yan Gao, Łukasz Dudziak, Stan Z. Li, Shell Xu Hu, Timothy Hospedales

Federated learning has been predominantly concerned with collaborative training of deep networks from scratch, and especially the many challenges that arise, such as communication cost, robustness to heterogeneous data, and support for diverse device capabilities.

Federated Learning

Integration of Pre-trained Protein Language Models into Geometric Deep Learning Networks

1 code implementation7 Dec 2022 Fang Wu, Lirong Wu, Dragomir Radev, Jinbo Xu, Stan Z. Li

Geometric deep learning has recently achieved great success in non-Euclidean domains, and learning on 3D structures of large biomolecules is emerging as a distinct research area.

Protein Interface Prediction Representation Learning

Deciphering RNA Secondary Structure Prediction: A Probabilistic K-Rook Matching Perspective

1 code implementation2 Dec 2022 Cheng Tan, Zhangyang Gao, Hanqun Cao, Xingran Chen, Ge Wang, Lirong Wu, Jun Xia, Jiangbin Zheng, Stan Z. Li

In this work, we reformulate the RNA secondary structure prediction as a K-Rook problem, thereby simplifying the prediction process into probabilistic matching within a finite solution space.

Prediction

Protein Language Models and Structure Prediction: Connection and Progression

1 code implementation30 Nov 2022 Bozhen Hu, Jun Xia, Jiangbin Zheng, Cheng Tan, Yufei Huang, Yongjie Xu, Stan Z. Li

The prediction of protein structures from sequences is an important task for function prediction, drug design, and related biological processes understanding.

Drug Design Prediction +3

SimVPv2: Towards Simple yet Powerful Spatiotemporal Predictive Learning

2 code implementations22 Nov 2022 Cheng Tan, Zhangyang Gao, Siyuan Li, Stan Z. Li

Recent years have witnessed remarkable advances in spatiotemporal predictive learning, with methods incorporating auxiliary inputs, complex neural architectures, and sophisticated training strategies.

Computational Efficiency Video Prediction

DiffBP: Generative Diffusion of 3D Molecules for Target Protein Binding

1 code implementation21 Nov 2022 Haitao Lin, Yufei Huang, Odin Zhang, Siqi Ma, Meng Liu, Xuanjing Li, Lirong Wu, Jishui Wang, Tingjun Hou, Stan Z. Li

Previous works usually generate atoms in an auto-regressive way, where element types and 3D coordinates of atoms are generated one by one.

Drug Discovery

EVNet: An Explainable Deep Network for Dimension Reduction

1 code implementation21 Nov 2022 Zelin Zang, Shenghui Cheng, Linyan Lu, Hanchen Xia, Liangyu Li, Yaoting Sun, Yongjie Xu, Lei Shang, Baigui Sun, Stan Z. Li

The proposed techniques are integrated with a visual interface to help the user to adjust EVNet to achieve better DR performance and explainability.

Data Augmentation Dimensionality Reduction

MogaNet: Multi-order Gated Aggregation Network

7 code implementations7 Nov 2022 Siyuan Li, Zedong Wang, Zicheng Liu, Cheng Tan, Haitao Lin, Di wu, ZhiYuan Chen, Jiangbin Zheng, Stan Z. Li

Notably, MogaNet hits 80. 0\% and 87. 8\% accuracy with 5. 2M and 181M parameters on ImageNet-1K, outperforming ParC-Net and ConvNeXt-L, while saving 59\% FLOPs and 17M parameters, respectively.

3D Human Pose Estimation Image Classification +6

Leveraging Graph-based Cross-modal Information Fusion for Neural Sign Language Translation

no code implementations1 Nov 2022 Jiangbin Zheng, Siyuan Li, Cheng Tan, Chong Wu, Yidong Chen, Stan Z. Li

Therefore, we propose to introduce additional word-level semantic knowledge of sign language linguistics to assist in improving current end-to-end neural SLT models.

Sign Language Translation Translation

Using Context-to-Vector with Graph Retrofitting to Improve Word Embeddings

1 code implementation ACL 2022 Jiangbin Zheng, Yile Wang, Ge Wang, Jun Xia, Yufei Huang, Guojiang Zhao, Yue Zhang, Stan Z. Li

Although contextualized embeddings generated from large-scale pre-trained models perform well in many tasks, traditional static embeddings (e. g., Skip-gram, Word2Vec) still play an important role in low-resource and lightweight settings due to their low computational cost, ease of deployment, and stability.

Word Embeddings Word Similarity

A Systematic Survey of Chemical Pre-trained Models

2 code implementations29 Oct 2022 Jun Xia, Yanqiao Zhu, Yuanqi Du, Stan Z. Li

Deep learning has achieved remarkable success in learning representations for molecules, which is crucial for various biochemical applications, ranging from property prediction to drug design.

Drug Design molecular representation +2

Teaching Yourself: Graph Self-Distillation on Neighborhood for Node Classification

no code implementations5 Oct 2022 Lirong Wu, Jun Xia, Haitao Lin, Zhangyang Gao, Zicheng Liu, Guojiang Zhao, Stan Z. Li

Despite their great academic success, Multi-Layer Perceptrons (MLPs) remain the primary workhorse for practical industrial applications.

Classification Node Classification

PiFold: Toward effective and efficient protein inverse folding

1 code implementation22 Sep 2022 Zhangyang Gao, Cheng Tan, Pablo Chacón, Stan Z. Li

How can we design protein sequences folding into the desired structures effectively and efficiently?

Decoder Protein Design

OpenMixup: Open Mixup Toolbox and Benchmark for Visual Representation Learning

1 code implementation11 Sep 2022 Siyuan Li, Zedong Wang, Zicheng Liu, Juanxi Tian, Di wu, Cheng Tan, Weiyang Jin, Stan Z. Li

Mixup augmentation has emerged as a widely used technique for improving the generalization ability of deep neural networks (DNNs).

Benchmarking Classification +3

A Survey on Generative Diffusion Model

1 code implementation6 Sep 2022 Hanqun Cao, Cheng Tan, Zhangyang Gao, Yilun Xu, Guangyong Chen, Pheng-Ann Heng, Stan Z. Li

Deep generative models are a prominent approach for data generation, and have been used to produce high quality samples in various domains.

Dimensionality Reduction model +1

What Does the Gradient Tell When Attacking the Graph Structure

no code implementations26 Aug 2022 Zihan Liu, Ge Wang, Yun Luo, Stan Z. Li

To address this issue, we propose a novel surrogate model with multi-level propagation that preserves the node dissimilarity information.

Are Gradients on Graph Structure Reliable in Gray-box Attacks?

1 code implementation7 Aug 2022 Zihan Liu, Yun Luo, Lirong Wu, Siyuan Li, Zicheng Liu, Stan Z. Li

These errors arise from rough gradient usage due to the discreteness of the graph structure and from the unreliability in the meta-gradient on the graph structure.

Computational Efficiency

Exploring Generative Neural Temporal Point Process

2 code implementations3 Aug 2022 Haitao Lin, Lirong Wu, Guojiang Zhao, Pai Liu, Stan Z. Li

While lots of previous works have focused on `goodness-of-fit' of TPP models by maximizing the likelihood, their predictive performance is unsatisfactory, which means the timestamps generated by models are far apart from true observations.

Denoising

UDRN: Unified Dimensional Reduction Neural Network for Feature Selection and Feature Projection

no code implementations8 Jul 2022 Zelin Zang, Yongjie Xu, Linyan Lu, Yulan Geng, Senqiao Yang, Stan Z. Li

We propose that the ideal DR approach combines both FS and FP into a unified end-to-end manifold learning framework, simultaneously performing fundamental feature discovery while maintaining the intrinsic relationships between data samples in the latent space.

Data Augmentation Dimensionality Reduction +1

DLME: Deep Local-flatness Manifold Embedding

2 code implementations7 Jul 2022 Zelin Zang, Siyuan Li, Di wu, Ge Wang, Lei Shang, Baigui Sun, Hao Li, Stan Z. Li

To overcome the underconstrained embedding problem, we design a loss and theoretically demonstrate that it leads to a more suitable embedding based on the local flatness.

Contrastive Learning Data Augmentation +1

CoSP: Co-supervised pretraining of pocket and ligand

no code implementations23 Jun 2022 Zhangyang Gao, Cheng Tan, Lirong Wu, Stan Z. Li

Can we inject the pocket-ligand interaction knowledge into the pre-trained model and jointly learn their chemical space?

Contrastive Learning Specificity

SimVP: Simpler yet Better Video Prediction

3 code implementations CVPR 2022 Zhangyang Gao, Cheng Tan, Lirong Wu, Stan Z. Li

From CNN, RNN, to ViT, we have witnessed remarkable advancements in video prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated training strategies.

Prediction Video Prediction

Hyperspherical Consistency Regularization

5 code implementations CVPR 2022 Cheng Tan, Zhangyang Gao, Lirong Wu, Siyuan Li, Stan Z. Li

Though it benefits from taking advantage of both feature-dependent information from self-supervised learning and label-dependent information from supervised learning, this scheme remains suffering from bias of the classifier.

Contrastive Learning Self-Supervised Learning +1

Discovering and Explaining the Representation Bottleneck of Graph Neural Networks from Multi-order Interactions

1 code implementation15 May 2022 Fang Wu, Siyuan Li, Lirong Wu, Dragomir Radev, Stan Z. Li

Graph neural networks (GNNs) mainly rely on the message-passing paradigm to propagate node features and build interactions, and different graph learning tasks require different ranges of node interactions.

graph construction Graph Learning +2

MVP-Human Dataset for 3D Human Avatar Reconstruction from Unconstrained Frames

1 code implementation24 Apr 2022 Xiangyu Zhu, Tingting Liao, Jiangjing Lyu, Xiang Yan, Yunfeng Wang, Kan Guo, Qiong Cao, Stan Z. Li, Zhen Lei

In this paper, we consider a novel problem of reconstructing a 3D human avatar from multiple unconstrained frames, independent of assumptions on camera calibration, capture space, and constrained actions.

Camera Calibration

Generative De Novo Protein Design with Global Context

1 code implementation21 Apr 2022 Cheng Tan, Zhangyang Gao, Jun Xia, Bozhen Hu, Stan Z. Li

Thus, we propose the Global-Context Aware generative de novo protein design method (GCA), consisting of local and global modules.

Protein Design Protein Structure Prediction

DiffMD: A Geometric Diffusion Model for Molecular Dynamics Simulations

no code implementations19 Apr 2022 Fang Wu, Stan Z. Li

To waive this requirement, we propose a novel model called DiffMD by directly estimating the gradient of the log density of molecular conformations.

Denoising Drug Discovery

STONet: A Neural-Operator-Driven Spatio-temporal Network

no code implementations18 Apr 2022 Haitao Lin, Guojiang Zhao, Lirong Wu, Stan Z. Li

Graph-based spatio-temporal neural networks are effective to model the spatial dependency among discrete points sampled irregularly from unstructured grids, thanks to the great expressiveness of graph neural networks.

Missing Values Time Series +1

Harnessing Hard Mixed Samples with Decoupled Regularizer

1 code implementation NeurIPS 2023 Zicheng Liu, Siyuan Li, Ge Wang, Cheng Tan, Lirong Wu, Stan Z. Li

However, we found that the extra optimizing step may be redundant because label-mismatched mixed samples are informative hard mixed samples for deep models to localize discriminative features.

Data Augmentation

A Survey of Pretraining on Graphs: Taxonomy, Methods, and Applications

3 code implementations16 Feb 2022 Jun Xia, Yanqiao Zhu, Yuanqi Du, Stan Z. Li

Pretrained Language Models (PLMs) such as BERT have revolutionized the landscape of Natural Language Processing (NLP).

Drug Discovery Graph Representation Learning +1

Target-aware Molecular Graph Generation

no code implementations10 Feb 2022 Cheng Tan, Zhangyang Gao, Stan Z. Li

Building on the recent advantages of flow-based molecular generation models, we propose SiamFlow, which forces the flow to fit the distribution of target sequence embeddings in latent space.

Drug Discovery Graph Generation +1

SimGRACE: A Simple Framework for Graph Contrastive Learning without Data Augmentation

1 code implementation7 Feb 2022 Jun Xia, Lirong Wu, Jintao Chen, Bozhen Hu, Stan Z. Li

Furthermore, we devise adversarial training scheme, dubbed \textbf{AT-SimGRACE}, to enhance the robustness of graph contrastive learning and theoretically explain the reasons.

Contrastive Learning Data Augmentation +1

AlphaDesign: A graph protein design method and benchmark on AlphaFoldDB

1 code implementation1 Feb 2022 Zhangyang Gao, Cheng Tan, Stan Z. Li

While DeepMind has tentatively solved protein folding, its inverse problem -- protein design which predicts protein sequences from their 3D structures -- still faces significant challenges.

Decoder Protein Design +1

GenURL: A General Framework for Unsupervised Representation Learning

1 code implementation27 Oct 2021 Siyuan Li, Zicheng Liu, Zelin Zang, Di wu, ZhiYuan Chen, Stan Z. Li

For example, dimension reduction methods, t-SNE, and UMAP optimize pair-wise data relationships by preserving the global geometric structure, while self-supervised learning, SimCLR, and BYOL focus on mining the local statistics of instances under specific augmentations.

Contrastive Learning Dimensionality Reduction +4

Surrogate Representation Learning with Isometric Mapping for Gray-box Graph Adversarial Attacks

no code implementations20 Oct 2021 Zihan Liu, Yun Luo, Zelin Zang, Stan Z. Li

Gray-box graph attacks aim at disrupting the performance of the victim model by using inconspicuous attacks with limited knowledge of the victim model.

Node Classification Representation Learning

ProGCL: Rethinking Hard Negative Mining in Graph Contrastive Learning

1 code implementation5 Oct 2021 Jun Xia, Lirong Wu, Ge Wang, Jintao Chen, Stan Z. Li

Contrastive Learning (CL) has emerged as a dominant technique for unsupervised representation learning which embeds augmented versions of the anchor close to each other (positive samples) and pushes the embeddings of other samples (negatives) apart.

Contrastive Learning Representation Learning

Beyond Message Passing Paradigm: Training Graph Data with Consistency Constraints

no code implementations29 Sep 2021 Lirong Wu, Stan Z. Li

Specifically, the GCL framework is optimized with three well-designed consistency constraints: neighborhood consistency, label consistency, and class-center consistency.

Improving Discriminative Visual Representation Learning via Automatic Mixup

no code implementations29 Sep 2021 Siyuan Li, Zicheng Liu, Di wu, Stan Z. Li

In this paper, we decompose mixup into two sub-tasks of mixup generation and classification and formulate it for discriminative representations as class- and instance-level mixup.

Data Augmentation Representation Learning

Co-learning: Learning from Noisy Labels with Self-supervision

1 code implementation5 Aug 2021 Cheng Tan, Jun Xia, Lirong Wu, Stan Z. Li

Noisy labels, resulting from mistakes in manual labeling or webly data collecting for supervised learning, can cause neural networks to overfit the misleading information and degrade the generalization performance.

Learning with noisy labels Self-Supervised Learning

A Data-driven feature selection and machine-learning model benchmark for the prediction of longitudinal dispersion coefficient

no code implementations16 Jul 2021 Yifeng Zhao, Pei Zhang, S. A. Galindo-Torres, Stan Z. Li

Then, a global optimal feature set (the channel width, the flow velocity, the channel slope and the cross sectional area) was proposed through numerical comparison of the distilled local optimums in performance with representative ML models.

Ensemble Learning feature selection

Unsupervised Deep Manifold Attributed Graph Embedding

1 code implementation27 Apr 2021 Zelin Zang, Siyuan Li, Di wu, Jianzhu Guo, Yongjie Xu, Stan Z. Li

Unsupervised attributed graph representation learning is challenging since both structural and feature information are required to be represented in the latent space.

Clustering Graph Embedding +3

AutoMix: Unveiling the Power of Mixup for Stronger Classifiers

3 code implementations24 Mar 2021 Zicheng Liu, Siyuan Li, Di wu, Zihan Liu, ZhiYuan Chen, Lirong Wu, Stan Z. Li

Specifically, AutoMix reformulates the mixup classification into two sub-tasks (i. e., mixed sample generation and mixup classification) with corresponding sub-networks and solves them in a bi-level optimization framework.

Classification Data Augmentation +3

Towards Robust Graph Neural Networks against Label Noise

no code implementations1 Jan 2021 Jun Xia, Haitao Lin, Yongjie Xu, Lirong Wu, Zhangyang Gao, Siyuan Li, Stan Z. Li

A pseudo label is computed from the neighboring labels for each node in the training set using LP; meta learning is utilized to learn a proper aggregation of the original and pseudo label as the final label.

Attribute Learning with noisy labels +3

Deep Manifold Computing and Visualization Using Elastic Locally Isometric Smoothness

no code implementations1 Jan 2021 Stan Z. Li, Zelin Zang, Lirong Wu

The ability to preserve local geometry of highly nonlinear manifolds in high dimensional spaces and properly unfold them into lower dimensional hyperplanes is the key to the success of manifold computing, nonlinear dimensionality reduction (NLDR) and visualization.

Dimensionality Reduction

Consistent Representation Learning for High Dimensional Data Analysis

no code implementations1 Dec 2020 Stan Z. Li, Lirong Wu, Zelin Zang

In this paper, we propose a novel neural network-based method, called Consistent Representation Learning (CRL), to accomplish the three associated tasks end-to-end and improve the consistencies.

Clustering Dimensionality Reduction +2

Face Forgery Detection by 3D Decomposition

no code implementations CVPR 2021 Xiangyu Zhu, Hao Wang, Hongyan Fei, Zhen Lei, Stan Z. Li

Detecting digital face manipulation has attracted extensive attention due to fake media's potential harms to the public.

3D geometry

Deep Manifold Transformation for Nonlinear Dimensionality Reduction

no code implementations28 Oct 2020 Stan Z. Li, Zelin Zang, Lirong Wu

The LGP constraints constitute the loss for deep manifold learning and serve as geometric regularizers for NLDR network training.

Dimensionality Reduction

Invertible Manifold Learning for Dimension Reduction

1 code implementation7 Oct 2020 Siyuan Li, Haitao Lin, Zelin Zang, Lirong Wu, Jun Xia, Stan Z. Li

Dimension reduction (DR) aims to learn low-dimensional representations of high-dimensional data with the preservation of essential information.

Dimensionality Reduction

Deep Clustering and Representation Learning that Preserves Geometric Structures

no code implementations28 Sep 2020 Lirong Wu, Zicheng Liu, Zelin Zang, Jun Xia, Siyuan Li, Stan Z. Li

To overcome the problem that clusteringoriented losses may deteriorate the geometric structure of embeddings in the latent space, an isometric loss is proposed for preserving intra-manifold structure locally and a ranking loss for inter-manifold structure globally.

Clustering Deep Clustering +1

Clustering Based on Graph of Density Topology

1 code implementation24 Sep 2020 Zhangyang Gao, Haitao Lin, Stan Z. Li

GDT jointly considers the local and global structures of data samples: firstly forming local clusters based on a density growing process with a strategy for properly noise handling as well as cluster boundary detection; and then estimating a GDT from relationship between local clusters in terms of a connectivity measure, givingglobal topological graph.

Boundary Detection Clustering

Generalized Clustering and Multi-Manifold Learning with Geometric Structure Preservation

1 code implementation21 Sep 2020 Lirong Wu, Zicheng Liu, Zelin Zang, Jun Xia, Siyuan Li, Stan Z. Li

Though manifold-based clustering has become a popular research topic, we observe that one important factor has been omitted by these works, namely that the defined clustering loss may corrupt the local and global structure of the latent space.

Clustering Deep Clustering +1

Towards Fast, Accurate and Stable 3D Dense Face Alignment

3 code implementations ECCV 2020 Jianzhu Guo, Xiangyu Zhu, Yang Yang, Fan Yang, Zhen Lei, Stan Z. Li

Firstly, on the basis of a lightweight backbone, we propose a meta-joint optimization strategy to dynamically regress a small set of 3DMM parameters, which greatly enhances speed and accuracy simultaneously.

 Ranked #1 on 3D Face Reconstruction on Florence (Mean NME metric)

3D Face Modelling 3D Face Reconstruction +2

SADet: Learning An Efficient and Accurate Pedestrian Detector

no code implementations26 Jul 2020 Chubin Zhuang, Zhen Lei, Stan Z. Li

Although the anchor-based detectors have taken a big step forward in pedestrian detection, the overall performance of algorithm still needs further improvement for practical applications, \emph{e. g.}, a good trade-off between the accuracy and efficiency.

Human Detection Pedestrian Detection +2

Markov-Lipschitz Deep Learning

2 code implementations15 Jun 2020 Stan Z. Li, Zelin Zang, Lirong Wu

We propose a novel framework, called Markov-Lipschitz deep learning (MLDL), to tackle geometric deterioration caused by collapse, twisting, or crossing in vector-based neural network transformations for manifold-based representation learning and manifold data generation.

Deep Learning Dimensionality Reduction +2

Cross-ethnicity Face Anti-spoofing Recognition Challenge: A Review

no code implementations23 Apr 2020 Ajian Liu, Xuan Li, Jun Wan, Sergio Escalera, Hugo Jair Escalante, Meysam Madadi, Yi Jin, Zhuoyuan Wu, Xiaogang Yu, Zichang Tan, Qi Yuan, Ruikun Yang, Benjia Zhou, Guodong Guo, Stan Z. Li

Although ethnic bias has been verified to severely affect the performance of face recognition systems, it still remains an open research problem in face anti-spoofing.

Face Anti-Spoofing Face Recognition

Learning Meta Face Recognition in Unseen Domains

8 code implementations CVPR 2020 Jianzhu Guo, Xiangyu Zhu, Chenxu Zhao, Dong Cao, Zhen Lei, Stan Z. Li

Face recognition systems are usually faced with unseen domains in real-world applications and show unsatisfactory performance due to their poor generalization.

Face Recognition Meta-Learning

CASIA-SURF CeFA: A Benchmark for Multi-modal Cross-ethnicity Face Anti-spoofing

no code implementations11 Mar 2020 Ajian Li, Zichang Tan, Xuan Li, Jun Wan, Sergio Escalera, Guodong Guo, Stan Z. Li

Ethnic bias has proven to negatively affect the performance of face recognition systems, and it remains an open research problem in face anti-spoofing.

Face Anti-Spoofing Face Recognition

Static and Dynamic Fusion for Multi-modal Cross-ethnicity Face Anti-spoofing

no code implementations5 Dec 2019 Ajian Liu, Zichang Tan, Xuan Li, Jun Wan, Sergio Escalera, Guodong Guo, Stan Z. Li

Regardless of the usage of deep learning and handcrafted methods, the dynamic information from videos and the effect of cross-ethnicity are rarely considered in face anti-spoofing.

Face Anti-Spoofing

Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection

13 code implementations CVPR 2020 Shifeng Zhang, Cheng Chi, Yongqiang Yao, Zhen Lei, Stan Z. Li

In this paper, we first point out that the essential difference between anchor-based and anchor-free detection is actually how to define positive and negative training samples, which leads to the performance gap between them.

Object object-detection +1

WiderPerson: A Diverse Dataset for Dense Pedestrian Detection in the Wild

no code implementations25 Sep 2019 Shifeng Zhang, Yiliang Xie, Jun Wan, Hansheng Xia, Stan Z. Li, Guodong Guo

To narrow this gap and facilitate future pedestrian detection research, we introduce a large and diverse dataset named WiderPerson for dense pedestrian detection in the wild.

Ranked #3 on Object Detection on WiderPerson (mMR metric)

Diversity Object Detection +1

Relational Learning for Joint Head and Human Detection

1 code implementation24 Sep 2019 Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

Head and human detection have been rapidly improved with the development of deep convolutional neural networks.

Head Detection Human Detection +1

PedHunter: Occlusion Robust Pedestrian Detector in Crowded Scenes

no code implementations15 Sep 2019 Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

Pedestrian detection in crowded scenes is a challenging problem, because occlusion happens frequently among different pedestrians.

Data Augmentation Occlusion Handling +2

RefineFace: Refinement Neural Network for High Performance Face Detection

no code implementations10 Sep 2019 Shifeng Zhang, Cheng Chi, Zhen Lei, Stan Z. Li

To improve the classification ability for high recall efficiency, STC first filters out most simple negatives from low level detection layers to reduce search space for subsequent classifier, then SML is applied to better distinguish faces from background at various scales and FSM is introduced to let the backbone learn more discriminative features for classification.

Classification Face Detection +3

CASIA-SURF: A Large-scale Multi-modal Benchmark for Face Anti-spoofing

no code implementations28 Aug 2019 Shifeng Zhang, Ajian Liu, Jun Wan, Yanyan Liang, Guogong Guo, Sergio Escalera, Hugo Jair Escalante, Stan Z. Li

To facilitate face anti-spoofing research, we introduce a large-scale multi-modal dataset, namely CASIA-SURF, which is the largest publicly available dataset for face anti-spoofing in terms of both subjects and modalities.

Face Anti-Spoofing Face Recognition

ChaLearn Looking at People: IsoGD and ConGD Large-scale RGB-D Gesture Recognition

no code implementations29 Jul 2019 Jun Wan, Chi Lin, Longyin Wen, Yunan Li, Qiguang Miao, Sergio Escalera, Gholamreza Anbarjafari, Isabelle Guyon, Guodong Guo, Stan Z. Li

The ChaLearn large-scale gesture recognition challenge has been run twice in two workshops in conjunction with the International Conference on Pattern Recognition (ICPR) 2016 and International Conference on Computer Vision (ICCV) 2017, attracting more than $200$ teams round the world.

Gesture Recognition

Improved Selective Refinement Network for Face Detection

no code implementations20 Jan 2019 Shifeng Zhang, Rui Zhu, Xiaobo Wang, Hailin Shi, Tianyu Fu, Shuo Wang, Tao Mei, Stan Z. Li

With the availability of face detection benchmark WIDER FACE dataset, much of the progresses have been made by various algorithms in recent years.

Data Augmentation Face Detection +1

Improving Face Anti-Spoofing by 3D Virtual Synthesis

3 code implementations2 Jan 2019 Jianzhu Guo, Xiangyu Zhu, Jinchuan Xiao, Zhen Lei, Genxun Wan, Stan Z. Li

Specifically, we consider a printed photo as a flat surface and mesh it into a 3D object, which is then randomly bent and rotated in 3D space.

Face Anti-Spoofing Face Recognition

A Dataset and Benchmark for Large-scale Multi-modal Face Anti-spoofing

2 code implementations CVPR 2019 Shifeng Zhang, Xiaobo Wang, Ajian Liu, Chenxu Zhao, Jun Wan, Sergio Escalera, Hailin Shi, Zezheng Wang, Stan Z. Li

To facilitate face anti-spoofing research, we introduce a large-scale multi-modal dataset, namely CASIA-SURF, which is the largest publicly available dataset for face anti-spoofing in terms of both subjects and visual modalities.

Face Anti-Spoofing Face Recognition

Selective Refinement Network for High Performance Face Detection

3 code implementations7 Sep 2018 Cheng Chi, Shifeng Zhang, Junliang Xing, Zhen Lei, Stan Z. Li, Xudong Zou

In particular, the SRN consists of two modules: the Selective Two-step Classification (STC) module and the Selective Two-step Regression (STR) module.

Face Detection General Classification +2

Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd

no code implementations ECCV 2018 Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, Stan Z. Li

Pedestrian detection in crowded scenes is a challenging problem since the pedestrians often gather together and occlude each other.

Ranked #10 on Pedestrian Detection on Caltech (using extra training data)

Pedestrian Detection

Large-scale Bisample Learning on ID Versus Spot Face Recognition

no code implementations8 Jun 2018 Xiangyu Zhu, Hao liu, Zhen Lei, Hailin Shi, Fan Yang, Dong Yi, Guo-Jun Qi, Stan Z. Li

In this paper, we propose a deep learning based large-scale bisample learning (LBL) method for IvS face recognition.

Face Recognition General Classification

Face Synthesis for Eyeglass-Robust Face Recognition

1 code implementation4 Jun 2018 Jianzhu Guo, Xiangyu Zhu, Zhen Lei, Stan Z. Li

A feasible method is to collect large-scale face images with eyeglasses for training deep learning methods.

Deep Learning Face Generation +3

Face Alignment in Full Pose Range: A 3D Total Solution

2 code implementations2 Apr 2018 Xiangyu Zhu, Xiaoming Liu, Zhen Lei, Stan Z. Li

In this paper, we propose to tackle these three challenges in an new alignment framework termed 3D Dense Face Alignment (3DDFA), in which a dense 3D Morphable Model (3DMM) is fitted to the image via Cascaded Convolutional Neural Networks.

3D Pose Estimation Depth Image Estimation +3

Single-Shot Refinement Neural Network for Object Detection

16 code implementations CVPR 2018 Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, Stan Z. Li

For object detection, the two-stage approach (e. g., Faster R-CNN) has been achieving the highest accuracy, whereas the one-stage approach (e. g., SSD) has the advantage of high efficiency.

Object object-detection +1

S3FD: Single Shot Scale-Invariant Face Detector

no code implementations ICCV 2017 Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li

This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S3FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces.

Face Detection

FaceBoxes: A CPU Real-time Face Detector with High Accuracy

11 code implementations17 Aug 2017 Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li

The MSCL aims at enriching the receptive fields and discretizing anchors over different layers to handle faces of various scales.

Face Detection Vocal Bursts Intensity Prediction

S$^3$FD: Single Shot Scale-invariant Face Detector

3 code implementations17 Aug 2017 Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li

This paper presents a real-time face detector, named Single Shot Scale-invariant Face Detector (S$^3$FD), which performs superiorly on various scales of faces with a single deep neural network, especially for small faces.

Face Detection

Learning Efficient Image Representation for Person Re-Identification

no code implementations7 Jul 2017 Yang Yang, Shengcai Liao, Zhen Lei, Stan Z. Li

Then, a robust image representation based on color names is obtained by concatenating the statistical descriptors in each stripe.

Diversity Person Re-Identification

Deep Person Re-Identification with Improved Embedding and Efficient Training

1 code implementation9 May 2017 Haibo Jin, Xiaobo Wang, Shengcai Liao, Stan Z. Li

However, to achieve this, existing deep models prefer to adopt image pairs or triplets to form verification loss, which is inefficient and unstable since the number of training pairs or triplets grows rapidly as the number of training data grows.

Person Re-Identification

Multi-Modality Fusion based on Consensus-Voting and 3D Convolution for Isolated Gesture Recognition

no code implementations21 Nov 2016 Jiali Duan, Shuai Zhou, Jun Wan, Xiaoyuan Guo, Stan Z. Li

Recently, the popularity of depth-sensors such as Kinect has made depth videos easily available while its advantages have not been fully exploited.

Gesture Recognition

Embedding Deep Metric for Person Re-identication A Study Against Large Variations

no code implementations1 Nov 2016 Hailin Shi, Yang Yang, Xiangyu Zhu, Shengcai Liao, Zhen Lei, Wei-Shi Zheng, Stan Z. Li

From this point of view, selecting suitable positive i. e. intra-class) training samples within a local range is critical for training the CNN embedding, especially when the data has large intra-class variations.

Person Re-Identification

CRAFT Objects from Images

1 code implementation CVPR 2016 Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li

They decompose the object detection problem into two cascaded easier tasks: 1) generating object proposals from images, 2) classifying proposals into various object categories.

Object object-detection +2

Efficient PSD Constrained Asymmetric Metric Learning for Person Re-Identification

no code implementations ICCV 2015 Shengcai Liao, Stan Z. Li

We argue that the PSD constraint provides a useful regularization to smooth the solution of the metric, and hence the learned metric is more robust than without the PSD constraint.

Metric Learning Person Re-Identification

Adaptively Unified Semi-Supervised Dictionary Learning With Active Points

no code implementations ICCV 2015 Xiaobo Wang, Xiaojie Guo, Stan Z. Li

In this paper, we present a novel semi-supervised dictionary learning method, which uses the informative coding vectors of both labeled and unlabeled data, and adaptively emphasizes the high confidence coding vectors of unlabeled data to enhance the dictionary discriminative capability simultaneously.

Dictionary Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.