Search Results for author: Shengjie Luo

Found 14 papers, 9 papers with code

Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation

no code implementations • 29 Jan 2024 • Zhenyu He, Guhao Feng, Shengjie Luo, Kai Yang, Di He, Jingjing Xu, Zhi Zhang, Hongxia Yang, LiWei Wang

In this work, we leverage the intrinsic segmentation of language sequences and design a new positional encoding method called Bilevel Positional Encoding (BiPE).

Disentanglement Position

Paper
Add Code

Enabling Efficient Equivariant Operations in the Fourier Basis via Gaunt Tensor Products

1 code implementation • 18 Jan 2024 • Shengjie Luo, Tianlang Chen, Aditi S. Krishnapriyan

We mathematically connect the commonly used Clebsch-Gordan coefficients to the Gaunt coefficients, which are integrals of products of three spherical harmonics.

Paper
Code

Learning a Fourier Transform for Linear Relative Positional Encodings in Transformers

no code implementations • 3 Feb 2023 • Krzysztof Marcin Choromanski, Shanda Li, Valerii Likhosherstov, Kumar Avinava Dubey, Shengjie Luo, Di He, Yiming Yang, Tamas Sarlos, Thomas Weingarten, Adrian Weller

We propose a new class of linear Transformers called FourierLearner-Transformers (FLTs), which incorporate a wide range of relative positional encoding mechanisms (RPEs).

Image Classification Inductive Bias +1

Paper
Add Code

Rethinking the Expressive Power of GNNs via Graph Biconnectivity

1 code implementation • 23 Jan 2023 • Bohang Zhang, Shengjie Luo, LiWei Wang, Di He

In this paper, we take a fundamentally different perspective to study the expressive power of GNNs beyond the WL test.

Paper
Code

One Transformer Can Understand Both 2D & 3D Molecular Data

1 code implementation • 4 Oct 2022 • Shengjie Luo, Tianlang Chen, Yixian Xu, Shuxin Zheng, Tie-Yan Liu, LiWei Wang, Di He

To achieve this goal, in this work, we develop a novel Transformer-based Molecular model called Transformer-M, which can take molecular data of 2D or 3D formats as input and generate meaningful semantic representations.

Ranked #4 on Graph Regression on PCQM4Mv2-LSC

Graph Regression molecular representation +1

193

Paper
Code

Your Transformer May Not be as Powerful as You Expect

1 code implementation • 26 May 2022 • Shengjie Luo, Shanda Li, Shuxin Zheng, Tie-Yan Liu, LiWei Wang, Di He

Extensive experiments covering typical architectures and tasks demonstrate that our model is parameter-efficient and can achieve superior performance to strong baselines in a wide range of applications.

Paper
Code

Benchmarking Graphormer on Large-Scale Molecular Modeling Datasets

3 code implementations • 9 Mar 2022 • Yu Shi, Shuxin Zheng, Guolin Ke, Yifei Shen, Jiacheng You, Jiyan He, Shengjie Luo, Chang Liu, Di He, Tie-Yan Liu

This technical note describes the recent updates of Graphormer, including architecture design modifications, and the adaption to 3D molecular dynamics simulation.

Ranked #7 on Initial Structure to Relaxed Energy (IS2RE), Direct on OC20

Benchmarking Graph Regression +1

1,953

Paper
Code

An Empirical Study of Graphormer on Large-Scale Molecular Modeling Datasets

no code implementations • 28 Feb 2022 • Yu Shi, Shuxin Zheng, Guolin Ke, Yifei Shen, Jiacheng You, Jiyan He, Shengjie Luo, Chang Liu, Di He, Tie-Yan Liu

This technical note describes the recent updates of Graphormer, including architecture design modifications, and the adaption to 3D molecular dynamics simulation.

Paper
Add Code

Do Transformers Really Perform Badly for Graph Representation?

no code implementations • NeurIPS 2021 • Chengxuan Ying, Tianle Cai, Shengjie Luo, Shuxin Zheng, Guolin Ke, Di He, Yanming Shen, Tie-Yan Liu

Our key insight to utilizing Transformer in the graph is the necessity of effectively encoding the structural information of a graph into the model.

Graph Representation Learning

Paper
Add Code

Stable, Fast and Accurate: Kernelized Attention with Relative Positional Encoding

no code implementations • NeurIPS 2021 • Shengjie Luo, Shanda Li, Tianle Cai, Di He, Dinglan Peng, Shuxin Zheng, Guolin Ke, LiWei Wang, Tie-Yan Liu

Since in many state-of-the-art models, relative positional encoding is used as default, designing efficient Transformers that can incorporate RPE is appealing.

Paper
Add Code

First Place Solution of KDD Cup 2021 & OGB Large-Scale Challenge Graph Prediction Track

4 code implementations • 15 Jun 2021 • Chengxuan Ying, Mingqi Yang, Shuxin Zheng, Guolin Ke, Shengjie Luo, Tianle Cai, Chenglin Wu, Yuxin Wang, Yanming Shen, Di He

In this technical report, we present our solution of KDD Cup 2021 OGB Large-Scale Challenge - PCQM4M-LSC Track.

1,953

Paper
Code

Do Transformers Really Perform Bad for Graph Representation?

4 code implementations • 9 Jun 2021 • Chengxuan Ying, Tianle Cai, Shengjie Luo, Shuxin Zheng, Guolin Ke, Di He, Yanming Shen, Tie-Yan Liu

Our key insight to utilizing Transformer in the graph is the necessity of effectively encoding the structural information of a graph into the model.

Ranked #1 on Graph Regression on PCQM4M-LSC

Graph Classification Graph Property Prediction +2

1,953

Paper
Code

Revisiting Language Encoding in Learning Multilingual Representations

1 code implementation • 16 Feb 2021 • Shengjie Luo, Kaiyuan Gao, Shuxin Zheng, Guolin Ke, Di He, LiWei Wang, Tie-Yan Liu

The language embedding can be either added to the word embedding or attached at the beginning of the sentence.

Sentence Word Embeddings

Paper
Code

GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training

1 code implementation • 7 Sep 2020 • Tianle Cai, Shengjie Luo, Keyulu Xu, Di He, Tie-Yan Liu, Li-Wei Wang

We provide an explanation by showing that InstanceNorm serves as a preconditioner for GNNs, but such preconditioning effect is weaker with BatchNorm due to the heavy batch noise in graph datasets.

Ranked #25 on Graph Property Prediction on ogbg-molhiv

Graph Classification Graph Representation Learning

100

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.