Search Results for author: Da Zheng

Found 30 papers, 17 papers with code

Active Community Detection in Massive Graphs

2 code implementations • 30 Dec 2014 • Heng Wang, Da Zheng, Randal Burns, Carey Priebe

A canonical problem in graph mining is the detection of dense communities.

Social and Information Networks Physics and Society

230

Paper
Code

Semi-External Memory Sparse Matrix Multiplication for Billion-Node Graphs

2 code implementations • 9 Feb 2016 • Da Zheng, Disa Mhembere, Vince Lyzinski, Joshua Vogelstein, Carey E. Priebe, Randal Burns

In contrast, we scale sparse matrix multiplication beyond memory capacity by implementing sparse matrix dense matrix multiplication (SpMM) in a semi-external memory (SEM) fashion; i. e., we keep the sparse matrix on commodity SSDs and dense matrices in memory.

Distributed, Parallel, and Cluster Computing

230

Paper
Code

FlashR: R-Programmed Parallel and Scalable Machine Learning using SSDs

2 code implementations • 21 Apr 2016 • Da Zheng, Disa Mhembere, Joshua T. Vogelstein, Carey E. Priebe, Randal Burns

R is one of the most popular programming languages for statistics and machine learning, but the R framework is relatively slow and unable to scale to large datasets.

Distributed, Parallel, and Cluster Computing

230

Paper
Code

knor: A NUMA-Optimized In-Memory, Distributed and Semi-External-Memory k-means Library

1 code implementation • 28 Jun 2016 • Disa Mhembere, Da Zheng, Carey E. Priebe, Joshua T. Vogelstein, Randal Burns

The \textit{k-means NUMA Optimized Routine} (\textsf{knor}) library has (i) in-memory (\textsf{knori}), (ii) distributed memory (\textsf{knord}), and (iii) semi-external memory (\textsf{knors}) modules that radically improve the performance of k-means for varying memory and hardware budgets.

Distributed, Parallel, and Cluster Computing

Paper
Code

Supervised Dimensionality Reduction for Big Data

1 code implementation • 5 Sep 2017 • Joshua T. Vogelstein, Eric Bridgeford, Minh Tang, Da Zheng, Christopher Douville, Randal Burns, Mauro Maggioni

To solve key biomedical problems, experimentalists now routinely measure millions or billions of features (dimensions) per sample, with the hope that data science techniques will be able to build accurate data-driven inferences.

Computational Efficiency General Classification +2

Paper
Code

Graphyti: A Semi-External Memory Graph Library for FlashGraph

no code implementations • 7 Jul 2019 • Disa Mhembere, Da Zheng, Carey E. Priebe, Joshua T. Vogelstein, Randal Burns

Emerging frameworks avoid the network bottleneck of distributed data with Semi-External Memory (SEM) that uses a single multicore node and operates on graphs larger than memory.

Distributed, Parallel, and Cluster Computing Databases

Paper
Add Code

Deep Graph Library: A Graph-Centric, Highly-Performant Package for Graph Neural Networks

7 code implementations • 3 Sep 2019 • Minjie Wang, Da Zheng, Zihao Ye, Quan Gan, Mufei Li, Xiang Song, Jinjing Zhou, Chao Ma, Lingfan Yu, Yu Gai, Tianjun Xiao, Tong He, George Karypis, Jinyang Li, Zheng Zhang

Advancing research in the emerging field of deep graph learning requires new tools to support tensor computation over graphs.

Ranked #35 on Node Classification on Cora

Graph Learning Node Classification

12,984

Paper
Code

DGL-KE: Training Knowledge Graph Embeddings at Scale

1 code implementation • 18 Apr 2020 • Da Zheng, Xiang Song, Chao Ma, Zeyuan Tan, Zihao Ye, Jin Dong, Hao Xiong, Zheng Zhang, George Karypis

Experiments on knowledge graphs consisting of over 86M nodes and 338M edges show that DGL-KE can compute embeddings in 100 minutes on an EC2 instance with 8 GPUs and 30 minutes on an EC2 cluster with 4 machines with 48 cores/machine.

Distributed, Parallel, and Cluster Computing

1,234

Paper
Code

Few-shot link prediction via graph neural networks for Covid-19 drug-repurposing

1 code implementation • 20 Jul 2020 • Vassilis N. Ioannidis, Da Zheng, George Karypis

This paper proposes an inductive RGCN for learning informative relation embeddings even in the few-shot learning regime.

Drug Discovery Few-Shot Learning +5

Paper
Code

PanRep: Graph neural networks for extracting universal node embeddings in heterogeneous graphs

1 code implementation • 20 Jul 2020 • Vassilis N. Ioannidis, Da Zheng, George Karypis

Learning unsupervised node embeddings facilitates several downstream tasks such as node classification and link prediction.

Link Prediction Node Classification

Paper
Code

FeatGraph: A Flexible and Efficient Backend for Graph Neural Network Systems

no code implementations • 26 Aug 2020 • Yuwei Hu, Zihao Ye, Minjie Wang, Jiali Yu, Da Zheng, Mu Li, Zheng Zhang, Zhiru Zhang, Yida Wang

FeatGraph provides a flexible programming interface to express diverse GNN models by composing coarse-grained sparse templates with fine-grained user-defined functions (UDFs) on each vertex/edge.

Paper
Add Code

PanRep: Universal node embeddings for heterogeneous graphs

no code implementations • 28 Sep 2020 • Vassilis N. Ioannidis, Da Zheng, George Karypis

Learning unsupervised node embeddings facilitates several downstream tasks such as node classification and link prediction.

Link Prediction Node Classification

Paper
Add Code

DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs

1 code implementation • 11 Oct 2020 • Da Zheng, Chao Ma, Minjie Wang, Jinjing Zhou, Qidong Su, Xiang Song, Quan Gan, Zheng Zhang, George Karypis

To minimize the overheads associated with distributed computations, DistDGL uses a high-quality and light-weight min-cut graph partitioning algorithm along with multiple balancing constraints.

Fraud Detection graph partitioning

12,984

Paper
Code

Learning over Families of Sets -- Hypergraph Representation Learning for Higher Order Tasks

no code implementations • 19 Jan 2021 • Balasubramaniam Srinivasan, Da Zheng, George Karypis

In this work, we exploit the incidence structure to develop a hypergraph neural network to learn provably expressive representations of variable sized hyperedges which preserve local-isomorphism in the line graph of the hypergraph, while also being invariant to permutations of its constituent vertices.

Graph Representation Learning hyperedge classification

Paper
Add Code

Schema-Aware Deep Graph Convolutional Networks for Heterogeneous Graphs

no code implementations • 3 May 2021 • Saurav Manchanda, Da Zheng, George Karypis

To address this question, we propose our GCN framework 'Deep Heterogeneous Graph Convolutional Network (DHGCN)', which takes advantage of the schema of a heterogeneous graph and uses a hierarchical approach to effectively utilize information many hops away.

Paper
Add Code

Global Neighbor Sampling for Mixed CPU-GPU Training on Giant Graphs

no code implementations • 11 Jun 2021 • Jialin Dong, Da Zheng, Lin F. Yang, Geroge Karypis

This global cache allows in-GPU importance sampling of mini-batches, which drastically reduces the number of nodes in a mini-batch, especially in the input layer, to reduce data copy between CPU and GPU and mini-batch computation without compromising the training convergence rate or model accuracy.

Fraud Detection

Paper
Add Code

TraverseNet: Unifying Space and Time in Message Passing for Traffic Forecasting

1 code implementation • 25 Aug 2021 • Zonghan Wu, Da Zheng, Shirui Pan, Quan Gan, Guodong Long, George Karypis

This paper aims to unify spatial dependency and temporal dependency in a non-Euclidean space while capturing the inner spatial-temporal dependencies for traffic data.

Attribute

Paper
Code

Dr. Top-k: Delegate-Centric Top-k on GPUs

1 code implementation • 16 Sep 2021 • Anil Gaihre, Da Zheng, Scott Weitze, Lingda Li, Shuaiwen Leon Song, Caiwen Ding, Xiaoye S Li, Hang Liu

Recent top-$k$ computation efforts explore the possibility of revising various sorting algorithms to answer top-$k$ queries on GPUs.

Paper
Code

TGL: A General Framework for Temporal GNN Training on Billion-Scale Graphs

2 code implementations • 28 Mar 2022 • Hongkuan Zhou, Da Zheng, Israt Nisa, Vasileios Ioannidis, Xiang Song, George Karypis

Our temporal parallel sampler achieves an average of 173x speedup on a multi-core CPU compared with the baselines.

Link Prediction Node Classification +1

172

Paper
Code

Nimble GNN Embedding with Tensor-Train Decomposition

no code implementations • 21 Jun 2022 • Chunxing Yin, Da Zheng, Israt Nisa, Christos Faloutos, George Karypis, Richard Vuduc

This paper describes a new method for representing embedding tables of graph neural networks (GNNs) more compactly via tensor-train (TT) decomposition.

graph partitioning

Paper
Add Code

Efficient and effective training of language and graph neural network models

no code implementations • 22 Jun 2022 • Vassilis N. Ioannidis, Xiang Song, Da Zheng, Houyu Zhang, Jun Ma, Yi Xu, Belinda Zeng, Trishul Chilimbi, George Karypis

The effectiveness in our framework is achieved by applying stage-wise fine-tuning of the BERT model first with heterogenous graph information and then with a GNN model.

Edge Classification Language Modelling +1

Paper
Add Code

From Local to Global: Spectral-Inspired Graph Neural Networks

1 code implementation • 24 Sep 2022 • Ningyuan Huang, Soledad Villar, Carey E. Priebe, Da Zheng, Chengyue Huang, Lin Yang, Vladimir Braverman

Graph Neural Networks (GNNs) are powerful deep learning methods for Non-Euclidean data.

Graph Embedding

Paper
Code

Hector: An Efficient Programming and Compilation Framework for Implementing Relational Graph Neural Networks in GPU Architectures

no code implementations • 16 Jan 2023 • Kun Wu, Mert Hidayetoğlu, Xiang Song, Sitao Huang, Da Zheng, Israt Nisa, Wen-mei Hwu

Relational graph neural networks (RGNNs) are graph neural networks with dedicated structures for modeling the different types of nodes and edges in heterogeneous graphs.

8k C++ code +1

Paper
Add Code

OrthoReg: Improving Graph-regularized MLPs via Orthogonality Regularization

no code implementations • 31 Jan 2023 • Hengrui Zhang, Shen Wang, Vassilis N. Ioannidis, Soji Adeshina, Jiani Zhang, Xiao Qin, Christos Faloutsos, Da Zheng, George Karypis, Philip S. Yu

Graph Neural Networks (GNNs) are currently dominating in modeling graph-structure data, while their high reliance on graph structure for inference significantly impedes them from widespread applications.

Node Classification

Paper
Add Code

PaGE-Link: Path-based Graph Neural Network Explanation for Heterogeneous Link Prediction

1 code implementation • 24 Feb 2023 • Shichang Zhang, Jiani Zhang, Xiang Song, Soji Adeshina, Da Zheng, Christos Faloutsos, Yizhou Sun

However, GNN explanation for link prediction (LP) is lacking in the literature.

Link Prediction

Paper
Code

Train Your Own GNN Teacher: Graph-Aware Distillation on Textual Graphs

1 code implementation • 20 Apr 2023 • Costas Mavromatis, Vassilis N. Ioannidis, Shen Wang, Da Zheng, Soji Adeshina, Jun Ma, Han Zhao, Christos Faloutsos, George Karypis

Different from conventional knowledge distillation, GRAD jointly optimizes a GNN teacher and a graph-free student over the graph's nodes via a shared LM.

Knowledge Distillation Node Classification

Paper
Code

Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications

no code implementations • 5 Jun 2023 • Han Xie, Da Zheng, Jun Ma, Houyu Zhang, Vassilis N. Ioannidis, Xiang Song, Qing Ping, Sheng Wang, Carl Yang, Yi Xu, Belinda Zeng, Trishul Chilimbi

Model pre-training on large text corpora has been demonstrated effective for various downstream applications in the NLP domain.

Graph Mining Language Modelling

Paper
Add Code

DistTGL: Distributed Memory-Based Temporal Graph Neural Network Training

no code implementations • 14 Jul 2023 • Hongkuan Zhou, Da Zheng, Xiang Song, George Karypis, Viktor Prasanna

Evenworse, the tremendous overhead to synchronize the node memory make it impractical to be deployed to distributed GPU clusters.

Graph Representation Learning

Paper
Add Code

Tango: rethinking quantization for graph neural network training on GPUs

no code implementations • 2 Aug 2023 • Shiyang Chen, Da Zheng, Caiwen Ding, Chengying Huan, Yuede Ji, Hang Liu

Graph Neural Networks (GNNs) are becoming increasingly popular due to their superior performance in critical graph-related tasks.

Quantization

Paper
Add Code

NetInfoF Framework: Measuring and Exploiting Network Usable Information

1 code implementation • 12 Feb 2024 • Meng-Chieh Lee, Haiyang Yu, Jian Zhang, Vassilis N. Ioannidis, Xiang Song, Soji Adeshina, Da Zheng, Christos Faloutsos

Given a node-attributed graph, and a graph task (link prediction or node classification), can we tell if a graph neural network (GNN) will perform well?

Link Prediction Node Classification

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.