Search Results for author: Jure Leskovec

Found 177 papers, 99 papers with code

STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases

1 code implementation • 19 Apr 2024 • Shirley Wu, Shiyu Zhao, Michihiro Yasunaga, Kexin Huang, Kaidi Cao, Qian Huang, Vassilis N. Ioannidis, Karthik Subbian, James Zou, Jure Leskovec

Answering real-world user queries, such as product search, often requires accurate retrieval of information from semi-structured knowledge bases or databases that involve blend of unstructured (e. g., textual descriptions of products) and structured (e. g., entity relations of products) information.

Benchmarking Retrieval

Paper
Code

From Similarity to Superiority: Channel Clustering for Time Series Forecasting

no code implementations • 31 Mar 2024 • Jialin Chen, Jan Eric Lenssen, Aosong Feng, Weihua Hu, Matthias Fey, Leandros Tassiulas, Jure Leskovec, Rex Ying

Motivated by our observation of a correlation between the time series model's performance boost against channel mixing and the intrinsic similarity on a pair of channels, we developed a novel and adaptable Channel Clustering Module (CCM).

Clustering Time Series +1

Paper
Add Code

PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning

1 code implementation • 31 Mar 2024 • Weihua Hu, Yiwen Yuan, Zecheng Zhang, Akihiro Nitta, Kaidi Cao, Vid Kocijan, Jure Leskovec, Matthias Fey

We present PyTorch Frame, a PyTorch-based framework for deep learning over multi-modal tabular data.

Ranked #1 on Binary Classification on kickstarter

Binary Classification Toxic Comment Classification

444

Paper
Code

Inferring Dynamic Networks from Marginals with Iterative Proportional Fitting

1 code implementation • 28 Feb 2024 • Serina Chang, Frederic Koehler, Zhaonan Qu, Jure Leskovec, Johan Ugander

A common network inference problem, arising from real-world data constraints, is how to infer a dynamic network from its time-aggregated adjacency matrix and time-varying marginals (i. e., row and column sums).

Paper
Code

Representation Learning for Frequent Subgraph Mining

no code implementations • 22 Feb 2024 • Rex Ying, Tianyu Fu, Andrew Wang, Jiaxuan You, Yu Wang, Jure Leskovec

SPMiner combines graph neural networks, order embedding space, and an efficient search strategy to identify network subgraph patterns that appear most frequently in the target graph.

Representation Learning Subgraph Counting

Paper
Add Code

Uncertainty Quantification for Forward and Inverse Problems of PDEs via Latent Global Evolution

2 code implementations • 13 Feb 2024 • Tailin Wu, Willie Neiswanger, Hongtao Zheng, Stefano Ermon, Jure Leskovec

Deep learning-based surrogate models have demonstrated remarkable advantages over classical solvers in terms of speed, often achieving speedups of 10 to 1000 times over traditional partial differential equation (PDE) solvers.

Decision Making Uncertainty Quantification

Paper
Code

Compositional Generative Inverse Design

1 code implementation • 24 Jan 2024 • Tailin Wu, Takashi Maruyama, Long Wei, Tao Zhang, Yilun Du, Gianluca Iaccarino, Jure Leskovec

In an N-body interaction task and a challenging 2D multi-airfoil design task, we demonstrate that by composing the learned diffusion model at test time, our method allows us to design initial states and boundary shapes that are more complex than those in the training data.

Paper
Code

Equivariant Graph Neural Operator for Modeling 3D Dynamics

no code implementations • 19 Jan 2024 • Minkai Xu, Jiaqi Han, Aaron Lou, Jean Kossaifi, Arvind Ramanathan, Kamyar Azizzadenesheli, Jure Leskovec, Stefano Ermon, Anima Anandkumar

Modeling the complex three-dimensional (3D) dynamics of relational systems is an important problem in the natural sciences, with applications ranging from molecular simulations to particle mechanics.

Operator learning

Paper
Add Code

TimeGraphs: Graph-based Temporal Reasoning

no code implementations • 6 Jan 2024 • Paridhi Maheshwari, Hongyu Ren, Yanan Wang, Rok Sosic, Jure Leskovec

The results demonstrate both robustness and efficiency of TimeGraphs on a range of temporal reasoning tasks.

Zero-shot Generalization

Paper
Add Code

Relational Deep Learning: Graph Representation Learning on Relational Databases

no code implementations • 7 Dec 2023 • Matthias Fey, Weihua Hu, Kexin Huang, Jan Eric Lenssen, Rishabh Ranjan, Joshua Robinson, Rex Ying, Jiaxuan You, Jure Leskovec

The core idea is to view relational databases as a temporal, heterogeneous graph, with a node for each row in each table, and edges specified by primary-foreign key links.

Feature Engineering Graph Representation Learning

Paper
Add Code

GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts

no code implementations • 7 Dec 2023 • Shirley Wu, Kaidi Cao, Bruno Ribeiro, James Zou, Jure Leskovec

Graph data are inherently complex and heterogeneous, leading to a high natural diversity of distributional shifts.

Paper
Add Code

Holistic Evaluation of Text-To-Image Models

1 code implementation • NeurIPS 2023 • Tony Lee, Michihiro Yasunaga, Chenlin Meng, Yifan Mai, Joon Sung Park, Agrim Gupta, Yunzhi Zhang, Deepak Narayanan, Hannah Benita Teufel, Marco Bellagente, Minguk Kang, Taesung Park, Jure Leskovec, Jun-Yan Zhu, Li Fei-Fei, Jiajun Wu, Stefano Ermon, Percy Liang

The stunning qualitative improvement of recent text-to-image models has led to their widespread attention and adoption.

Fairness

1,640

Paper
Code

Context-Aware Meta-Learning

1 code implementation • 17 Oct 2023 • Christopher Fifty, Dennis Duan, Ronald G. Junkins, Ehsan Amid, Jure Leskovec, Christopher Re, Sebastian Thrun

Large Language Models like ChatGPT demonstrate a remarkable capacity to learn new concepts during inference without any fine-tuning.

Ranked #1 on Few-Shot Image Classification on Mini-Imagenet 5-way (5-shot) (using extra training data)

Few-Shot Image Classification In-Context Learning +1

Paper
Code

In-Context Learning for Few-Shot Molecular Property Prediction

no code implementations • 13 Oct 2023 • Christopher Fifty, Jure Leskovec, Sebastian Thrun

In this paper, we adapt the concepts underpinning in-context learning to develop a new algorithm for few-shot molecular property prediction.

Few-Shot Learning In-Context Learning +2

Paper
Add Code

MLAgentBench: Evaluating Language Agents on Machine Learning Experimentation

1 code implementation • 5 Oct 2023 • Qian Huang, Jian Vora, Percy Liang, Jure Leskovec

A central aspect of machine learning research is experimentation, the process of designing and running experiments, analyzing the results, and iterating towards some positive outcome (e. g., improving accuracy).

Benchmarking Decision Making +1

178

Paper
Code

Large Language Models as Analogical Reasoners

no code implementations • 3 Oct 2023 • Michihiro Yasunaga, Xinyun Chen, Yujia Li, Panupong Pasupat, Jure Leskovec, Percy Liang, Ed H. Chi, Denny Zhou

Chain-of-thought (CoT) prompting for language models demonstrates impressive performance across reasoning tasks, but typically needs labeled exemplars of the reasoning process.

Code Generation GSM8K +1

Paper
Add Code

Communication-Free Distributed GNN Training with Vertex Cut

no code implementations • 6 Aug 2023 • Kaidi Cao, Rui Deng, Shirley Wu, Edward W Huang, Karthik Subbian, Jure Leskovec

Here, we introduce CoFree-GNN, a novel distributed GNN training framework that significantly speeds up the training process by implementing communication-free training.

Ranked #2 on Node Classification on Reddit

Node Classification

Paper
Add Code

VQGraph: Rethinking Graph Representation Space for Bridging GNNs and MLPs

1 code implementation • 4 Aug 2023 • Ling Yang, Ye Tian, Minkai Xu, Zhongyi Liu, Shenda Hong, Wei Qu, Wentao Zhang, Bin Cui, Muhan Zhang, Jure Leskovec

To address this issue, we propose to learn a new powerful graph representation space by directly labeling nodes' diverse local structures for GNN-to-MLP distillation.

Knowledge Distillation Quantization +1

Paper
Code

Med-Flamingo: a Multimodal Medical Few-shot Learner

1 code implementation • 27 Jul 2023 • Michael Moor, Qian Huang, Shirley Wu, Michihiro Yasunaga, Cyril Zakka, Yash Dalmia, Eduardo Pontes Reis, Pranav Rajpurkar, Jure Leskovec

However, existing models typically have to be fine-tuned on sizeable down-stream datasets, which poses a significant limitation as in many medical applications data is scarce, necessitating models that are capable of learning from few examples in real-time.

Medical Visual Question Answering Question Answering +1

352

Paper
Code

Artificial Intelligence for Science in Quantum, Atomistic, and Continuum Systems

1 code implementation • 17 Jul 2023 • Xuan Zhang, Limei Wang, Jacob Helwig, Youzhi Luo, Cong Fu, Yaochen Xie, Meng Liu, Yuchao Lin, Zhao Xu, Keqiang Yan, Keir Adams, Maurice Weiler, Xiner Li, Tianfan Fu, Yucheng Wang, Haiyang Yu, Yuqing Xie, Xiang Fu, Alex Strasser, Shenglong Xu, Yi Liu, Yuanqi Du, Alexandra Saxton, Hongyi Ling, Hannah Lawrence, Hannes Stärk, Shurui Gui, Carl Edwards, Nicholas Gao, Adriana Ladera, Tailin Wu, Elyssa F. Hofgard, Aria Mansouri Tehrani, Rui Wang, Ameya Daigavane, Montgomery Bohde, Jerry Kurtin, Qian Huang, Tuong Phung, Minkai Xu, Chaitanya K. Joshi, Simon V. Mathis, Kamyar Azizzadenesheli, Ada Fang, Alán Aspuru-Guzik, Erik Bekkers, Michael Bronstein, Marinka Zitnik, Anima Anandkumar, Stefano Ermon, Pietro Liò, Rose Yu, Stephan Günnemann, Jure Leskovec, Heng Ji, Jimeng Sun, Regina Barzilay, Tommi Jaakkola, Connor W. Coley, Xiaoning Qian, Xiaofeng Qian, Tess Smidt, Shuiwang Ji

Advances in artificial intelligence (AI) are fueling a new paradigm of discoveries in natural sciences.

Out-of-Distribution Generalization Transfer Learning +1

402

Paper
Code

Temporal Graph Benchmark for Machine Learning on Temporal Graphs

2 code implementations • NeurIPS 2023 • Shenyang Huang, Farimah Poursafaei, Jacob Danovitch, Matthias Fey, Weihua Hu, Emanuele Rossi, Jure Leskovec, Michael Bronstein, Guillaume Rabusseau, Reihaneh Rabbany

We present the Temporal Graph Benchmark (TGB), a collection of challenging and diverse benchmark datasets for realistic, reproducible, and robust evaluation of machine learning models on temporal graphs.

Node Property Prediction Property Prediction

147

Paper
Code

Enabling tabular deep learning when $d \gg n$ with an auxiliary knowledge graph

no code implementations • 7 Jun 2023 • Camilo Ruiz, Hongyu Ren, Kexin Huang, Jure Leskovec

However, for tabular datasets with extremely high $d$-dimensional features but limited $n$ samples (i. e. $d \gg n$), machine learning models struggle to achieve strong performance due to the risk of overfitting.

Inductive Bias

Paper
Add Code

Uncertainty Quantification over Graph with Conformalized Graph Neural Networks

1 code implementation • NeurIPS 2023 • Kexin Huang, Ying Jin, Emmanuel Candès, Jure Leskovec

We establish a permutation invariance condition that enables the validity of CP on graph data and provide an exact characterization of the test-time coverage.

Conformal Prediction Uncertainty Quantification +1

Paper
Code

PRODIGY: Enabling In-context Learning Over Graphs

no code implementations • NeurIPS 2023 • Qian Huang, Hongyu Ren, Peng Chen, Gregor Kržmanc, Daniel Zeng, Percy Liang, Jure Leskovec

In-context learning is the ability of a pretrained model to adapt to novel and diverse downstream tasks by conditioning on prompt examples, without optimizing any parameters.

In-Context Learning Knowledge Graphs

Paper
Add Code

Learning Large Graph Property Prediction via Graph Segment Training

1 code implementation • NeurIPS 2023 • Kaidi Cao, Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Dustin Zelle, Yanqi Zhou, Charith Mendis, Jure Leskovec, Bryan Perozzi

Here we propose Graph Segment Training (GST), a general framework that utilizes a divide-and-conquer approach to allow learning large graph property prediction with a constant memory footprint.

Graph Property Prediction Property Prediction

Paper
Code

Geometric Latent Diffusion Models for 3D Molecule Generation

2 code implementations • 2 May 2023 • Minkai Xu, Alexander Powers, Ron Dror, Stefano Ermon, Jure Leskovec

Generative models, especially diffusion models (DMs), have achieved promising results for generating feature-rich geometries and advancing foundational science problems such as molecule design.

3D Molecule Generation valid

160

Paper
Code

Learning Controllable Adaptive Simulation for Multi-resolution Physics

1 code implementation • 1 May 2023 • Tailin Wu, Takashi Maruyama, Qingqing Zhao, Gordon Wetzstein, Jure Leskovec

In this work, we introduce Learning controllable Adaptive simulation for Multi-resolution Physics (LAMP) as the first full deep learning-based surrogate model that jointly learns the evolution model and optimizes appropriate spatial resolutions that devote more compute to the highly dynamic regions.

Paper
Code

When Do Graph Neural Networks Help with Node Classification? Investigating the Impact of Homophily Principle on Node Distinguishability

1 code implementation • 25 Apr 2023 • Sitao Luan, Chenqing Hua, Minkai Xu, Qincheng Lu, Jiaqi Zhu, Xiao-Wen Chang, Jie Fu, Jure Leskovec, Doina Precup

Homophily principle, i. e., nodes with the same labels are more likely to be connected, has been believed to be the main reason for the performance superiority of Graph Neural Networks (GNNs) over Neural Networks on node classification tasks.

Node Classification Stochastic Block Model

Paper
Code

Neural Graph Reasoning: Complex Logical Query Answering Meets Graph Databases

1 code implementation • 26 Mar 2023 • Hongyu Ren, Mikhail Galkin, Michael Cochez, Zhaocheng Zhu, Jure Leskovec

Extending the idea of graph databases (graph DBs), NGDB consists of a Neural Graph Storage and a Neural Graph Engine.

Link Prediction Logical Reasoning +1

152

Paper
Code

AutoTransfer: AutoML with Knowledge Transfer -- An Application to Graph Neural Networks

1 code implementation • 14 Mar 2023 • Kaidi Cao, Jiaxuan You, Jiaju Liu, Jure Leskovec

Experiments demonstrate that (i) our proposed task embedding can be computed efficiently, and that tasks with similar embeddings have similar best-performing architectures; (ii) AutoTransfer significantly improves search efficiency with the transferred design priors, reducing the number of explored architectures by an order of magnitude.

AutoML Transfer Learning

Paper
Code

Relational Multi-Task Learning: Modeling Relations between Data and Tasks

1 code implementation • ICLR 2022 • Kaidi Cao, Jiaxuan You, Jure Leskovec

Here we introduce a novel relational multi-task learning setting where we leverage data point labels from auxiliary tasks to make more accurate predictions on the new task.

Multi-Task Learning

1,601

Paper
Code

Implicit Geometry and Interaction Embeddings Improve Few-Shot Molecular Property Prediction

1 code implementation • 4 Feb 2023 • Christopher Fifty, Joseph M. Paggi, Ehsan Amid, Jure Leskovec, Ron Dror

However, many important molecular properties depend on complex molecular characteristics -- such as the various 3D geometries a molecule may adopt or the types of chemical interactions it can form -- that are not explicitly encoded in the feature space and must be approximated from low amounts of data.

Few-Shot Learning Molecular Docking +4

Paper
Code

Zero-shot causal learning

1 code implementation • NeurIPS 2023 • Hamed Nilforoshan, Michael Moor, Yusuf Roohani, Yining Chen, Anja Šurina, Michihiro Yasunaga, Sara Oblak, Jure Leskovec

There are a large number of methods to predict the effect of an existing intervention based on historical data from individuals who received it.

Marketing Meta-Learning

Paper
Code

Learning Graph Search Heuristics

no code implementations • Learning on Graphs 2022 • Michal Pándy, Weikang Qiu, Gabriele Corso, Petar Veličković, Rex Ying, Jure Leskovec, Pietro Liò

At training time, we aggregate datasets of search trajectories and ground-truth shortest path distances, which we use to train a specialized graph neural network-based heuristic function using backpropagation through steps of the pathfinding process.

Graph Representation Learning Imitation Learning

Paper
Add Code

Retrieval-Augmented Multimodal Language Modeling

no code implementations • 22 Nov 2022 • Michihiro Yasunaga, Armen Aghajanyan, Weijia Shi, Rich James, Jure Leskovec, Percy Liang, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih

To integrate knowledge in a more scalable and modular way, we propose a retrieval-augmented multimodal model, which enables a base multimodal model (generator) to refer to relevant text and images fetched by a retriever from external memory (e. g., documents on the web).

Ranked #7 on Image Captioning on MS COCO

Caption Generation Image Captioning +5

Paper
Add Code

CausalBench: A Large-scale Benchmark for Network Inference from Single-cell Perturbation Data

2 code implementations • 31 Oct 2022 • Mathieu Chevalley, Yusuf Roohani, Arash Mehrjou, Jure Leskovec, Patrick Schwab

Traditional evaluations conducted on synthetic datasets do not reflect the performance in real-world systems.

Causal Discovery Causal Inference +1

Paper
Code

TuneUp: A Simple Improved Training Strategy for Graph Neural Networks

no code implementations • 26 Oct 2022 • Weihua Hu, Kaidi Cao, Kexin Huang, Edward W Huang, Karthik Subbian, Kenji Kawaguchi, Jure Leskovec

Extensive evaluation of TuneUp on five diverse GNN architectures, three types of prediction tasks, and both transductive and inductive settings shows that TuneUp significantly improves the performance of the base GNN on tail nodes, while often even improving the performance on head nodes.

Data Augmentation

Paper
Add Code

Efficient Automatic Machine Learning via Design Graphs

1 code implementation • 21 Oct 2022 • Shirley Wu, Jiaxuan You, Jure Leskovec, Rex Ying

FALCON features 1) a task-agnostic module, which performs message passing on the design graph via a Graph Neural Network (GNN), and 2) a task-specific module, which conducts label propagation of the known model performance information on the design graph.

AutoML Graph Classification +1

Paper
Code

Deep Bidirectional Language-Knowledge Graph Pretraining

1 code implementation • 17 Oct 2022 • Michihiro Yasunaga, Antoine Bosselut, Hongyu Ren, Xikun Zhang, Christopher D Manning, Percy Liang, Jure Leskovec

Pretraining a language model (LM) on text has been shown to help various downstream NLP tasks.

Ranked #1 on Riddle Sense on RiddleSense

Knowledge Graphs Language Modelling +4

288

Paper
Code

Few-shot Relational Reasoning via Connection Subgraph Pretraining

1 code implementation • 13 Oct 2022 • Qian Huang, Hongyu Ren, Jure Leskovec

Our pretrained model can then be directly applied to target few-shot tasks on without the need for training few-shot tasks.

Ranked #4 on Few-Shot Image Classification on ImageNet-FS (5-shot, all)

Few-Shot Image Classification Meta-Learning +2

Paper
Code

ROLAND: Graph Learning Framework for Dynamic Graphs

2 code implementations • 15 Aug 2022 • Jiaxuan You, Tianyu Du, Jure Leskovec

Finally, we propose a scalable and efficient training approach for dynamic GNNs via incremental training and meta-learning.

Graph Learning Graph Representation Learning +2

1,601

Paper
Code

ViRel: Unsupervised Visual Relations Discovery with Graph-level Analogy

no code implementations • 4 Jul 2022 • Daniel Zeng, Tailin Wu, Jure Leskovec

Here, we introduce ViRel, a method for unsupervised discovery and learning of Visual Relations with graph-level analogy.

Relation Relation Classification

Paper
Add Code

ZeroC: A Neuro-Symbolic Model for Zero-shot Concept Recognition and Acquisition at Inference Time

1 code implementation • 30 Jun 2022 • Tailin Wu, Megan Tjandrasuwita, Zhengxuan Wu, Xuelin Yang, Kevin Liu, Rok Sosič, Jure Leskovec

In this work, we introduce Zero-shot Concept Recognition and Acquisition (ZeroC), a neuro-symbolic architecture that can recognize and acquire novel concepts in a zero-shot way.

Novel Concepts

Paper
Code

Learning to Accelerate Partial Differential Equations via Latent Global Evolution

1 code implementation • 15 Jun 2022 • Tailin Wu, Takashi Maruyama, Jure Leskovec

We test our method in a 1D benchmark of nonlinear PDEs, 2D Navier-Stokes flows into turbulent phase and an inverse optimization of boundary conditions in 2D Navier-Stokes flow.

Weather Forecasting

Paper
Code

Learning Large-scale Subsurface Simulations with a Hybrid Graph Network Simulator

no code implementations • 15 Jun 2022 • Tailin Wu, Qinchen Wang, Yinan Zhang, Rex Ying, Kaidi Cao, Rok Sosič, Ridwan Jalali, Hassan Hamam, Marko Maucec, Jure Leskovec

To model complex reservoir dynamics at both local and global scale, HGNS consists of a subsurface graph neural network (SGNN) to model the evolution of fluid flows, and a 3D-U-Net to model the evolution of pressure.

Decision Making

Paper
Add Code

Learning Backward Compatible Embeddings

1 code implementation • 7 Jun 2022 • Weihua Hu, Rajas Bansal, Kaidi Cao, Nikhil Rao, Karthik Subbian, Jure Leskovec

We formalize the problem where the goal is for the embedding team to keep updating the embedding version, while the consumer teams do not have to retrain their models.

Fraud Detection Product Recommendation +1

Paper
Code

ItemSage: Learning Product Embeddings for Shopping Recommendations at Pinterest

no code implementations • 24 May 2022 • Paul Baltescu, Haoyu Chen, Nikil Pancha, Andrew Zhai, Jure Leskovec, Charles Rosenberg

Learned embeddings for products are an important building block for web-scale e-commerce recommendation systems.

Multi-Task Learning Recommendation Systems

Paper
Add Code

VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering

no code implementations • ICCV 2023 • Yanan Wang, Michihiro Yasunaga, Hongyu Ren, Shinya Wada, Jure Leskovec

Visual question answering (VQA) requires systems to perform concept-level reasoning by unifying unstructured (e. g., the context in question and answer; "QA context") and structured (e. g., knowledge graph for the QA context and scene; "concept graph") multimodal knowledge.

Knowledge Graphs Question Answering +1

Paper
Add Code

MultiBiSage: A Web-Scale Recommendation System Using Multiple Bipartite Graphs at Pinterest

no code implementations • 21 May 2022 • Saket Gurukar, Nikil Pancha, Andrew Zhai, Eric Kim, Samson Hu, Srinivasan Parthasarathy, Charles Rosenberg, Jure Leskovec

MultiBiSage can capture the graph structure of multiple bipartite graphs to learn high-quality pin embeddings.

Paper
Add Code

PinnerFormer: Sequence Modeling for User Representation at Pinterest

no code implementations • 9 May 2022 • Nikil Pancha, Andrew Zhai, Jure Leskovec, Charles Rosenberg

Sequential models have become increasingly popular in powering personalized recommendation systems over the past several years.

Recommendation Systems

Paper
Add Code

AdaGrid: Adaptive Grid Search for Link Prediction Training Objective

1 code implementation • 30 Mar 2022 • Tim Poštuvan, Jiaxuan You, Mohammadreza Banaei, Rémi Lebret, Jure Leskovec

To mitigate these limitations, we propose Adaptive Grid Search (AdaGrid), which dynamically adjusts the edge message ratio during training.

BIG-bench Machine Learning Link Prediction

Paper
Code

LinkBERT: Pretraining Language Models with Document Links

1 code implementation • ACL 2022 • Michihiro Yasunaga, Jure Leskovec, Percy Liang

Language model (LM) pretraining can learn various knowledge from text corpora, helping downstream tasks.

Ranked #1 on Semantic Similarity on BIOSSES

Document Classification Language Modelling +9

394

Paper
Code

GreaseLM: Graph REASoning Enhanced Language Models for Question Answering

1 code implementation • 21 Jan 2022 • Xikun Zhang, Antoine Bosselut, Michihiro Yasunaga, Hongyu Ren, Percy Liang, Christopher D. Manning, Jure Leskovec

Answering complex questions about textual narratives requires reasoning over both stated context and the world knowledge that underlies it.

Knowledge Graphs Negation +2

219

Paper
Code

Extending the WILDS Benchmark for Unsupervised Adaptation

1 code implementation • ICLR 2022 • Shiori Sagawa, Pang Wei Koh, Tony Lee, Irena Gao, Sang Michael Xie, Kendrick Shen, Ananya Kumar, Weihua Hu, Michihiro Yasunaga, Henrik Marklund, Sara Beery, Etienne David, Ian Stavness, Wei Guo, Jure Leskovec, Kate Saenko, Tatsunori Hashimoto, Sergey Levine, Chelsea Finn, Percy Liang

Unlabeled data can be a powerful point of leverage for mitigating these distribution shifts, as it is frequently much more available than labeled data and can often be obtained from distributions beyond the source distribution as well.

Paper
Code

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

1 code implementation • 28 Oct 2021 • Hongyu Ren, Hanjun Dai, Bo Dai, Xinyun Chen, Denny Zhou, Jure Leskovec, Dale Schuurmans

There are two important reasoning tasks on KGs: (1) single-hop knowledge graph completion, which involves predicting individual links in the KG; and (2), multi-hop reasoning, where the goal is to predict which KG entities satisfy a given logical query.

Scheduling

158

Paper
Code

Modeling Heterogeneous Hierarchies with Relation-specific Hyperbolic Cones

1 code implementation • NeurIPS 2021 • Yushi Bai, Rex Ying, Hongyu Ren, Jure Leskovec

Here we present ConE (Cone Embedding), a KG embedding model that is able to simultaneously model multiple hierarchical as well as non-hierarchical relations in a knowledge graph.

Ranked #1 on Ancestor-descendant prediction on WN18RR

Ancestor-descendant prediction Knowledge Graph Completion +2

Paper
Code

GreaseLM: Graph REASoning Enhanced Language Models

no code implementations • ICLR 2022 • Xikun Zhang, Antoine Bosselut, Michihiro Yasunaga, Hongyu Ren, Percy Liang, Christopher D Manning, Jure Leskovec

Answering complex questions about textual narratives requires reasoning over both stated context and the world knowledge that underlies it.

Knowledge Graphs Negation +2

Paper
Add Code

Leveraging the Cell Ontology to classify unseen cell types

1 code implementation • Nature Communications 2021 • Sheng Wang, Angela Oliveira Pisco, Aaron McGeever, Maria Brbic, Marinka Zitnik, Spyros Darmanis, Jure Leskovec, Jim Karkanias, Russ B. Altman

Single cell technologies are rapidly generating large amounts of data that enables us to understand biological systems at single-cell resolution.

Paper
Code

Neural Distance Embeddings for Biological Sequences

1 code implementation • NeurIPS 2021 • Gabriele Corso, Rex Ying, Michal Pándy, Petar Veličković, Jure Leskovec, Pietro Liò

The development of data-dependent heuristics and representations for biological sequences that reflect their evolutionary distance is critical for large-scale biological research.

Multiple Sequence Alignment

Paper
Code

LM-Critic: Language Models for Unsupervised Grammatical Error Correction

2 code implementations • EMNLP 2021 • Michihiro Yasunaga, Jure Leskovec, Percy Liang

Training a model for grammatical error correction (GEC) requires a set of labeled ungrammatical / grammatical sentence pairs, but manually annotating such pairs can be expensive.

Ranked #2 on Grammatical Error Correction on Unrestricted

Grammatical Error Correction Language Modelling +2

863

Paper
Code

On the Opportunities and Risks of Foundation Models

2 code implementations • 16 Aug 2021 • Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh, Li Fei-Fei, Chelsea Finn, Trevor Gale, Lauren Gillespie, Karan Goel, Noah Goodman, Shelby Grossman, Neel Guha, Tatsunori Hashimoto, Peter Henderson, John Hewitt, Daniel E. Ho, Jenny Hong, Kyle Hsu, Jing Huang, Thomas Icard, Saahil Jain, Dan Jurafsky, Pratyusha Kalluri, Siddharth Karamcheti, Geoff Keeling, Fereshte Khani, Omar Khattab, Pang Wei Koh, Mark Krass, Ranjay Krishna, Rohith Kuditipudi, Ananya Kumar, Faisal Ladhak, Mina Lee, Tony Lee, Jure Leskovec, Isabelle Levent, Xiang Lisa Li, Xuechen Li, Tengyu Ma, Ali Malik, Christopher D. Manning, Suvir Mirchandani, Eric Mitchell, Zanele Munyikwa, Suraj Nair, Avanika Narayan, Deepak Narayanan, Ben Newman, Allen Nie, Juan Carlos Niebles, Hamed Nilforoshan, Julian Nyarko, Giray Ogut, Laurel Orr, Isabel Papadimitriou, Joon Sung Park, Chris Piech, Eva Portelance, Christopher Potts, aditi raghunathan, Rob Reich, Hongyu Ren, Frieda Rong, Yusuf Roohani, Camilo Ruiz, Jack Ryan, Christopher Ré, Dorsa Sadigh, Shiori Sagawa, Keshav Santhanam, Andy Shih, Krishnan Srinivasan, Alex Tamkin, Rohan Taori, Armin W. Thomas, Florian Tramèr, Rose E. Wang, William Wang, Bohan Wu, Jiajun Wu, Yuhuai Wu, Sang Michael Xie, Michihiro Yasunaga, Jiaxuan You, Matei Zaharia, Michael Zhang, Tianyi Zhang, Xikun Zhang, Yuhui Zhang, Lucia Zheng, Kaitlyn Zhou, Percy Liang

AI is undergoing a paradigm shift with the rise of models (e. g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks.

Transfer Learning

847

Paper
Code

Combiner: Full Attention Transformer with Sparse Computation Cost

2 code implementations • NeurIPS 2021 • Hongyu Ren, Hanjun Dai, Zihang Dai, Mengjiao Yang, Jure Leskovec, Dale Schuurmans, Bo Dai

However, the key limitation of transformers is their quadratic memory and time complexity $\mathcal{O}(L^2)$ with respect to the sequence length in attention layers, which restricts application in extremely long sequences.

Ranked #2 on Language Modelling on Wiki-40B

Image Generation Language Modelling

32,819

Paper
Code

GNNAutoScale: Scalable and Expressive Graph Neural Networks via Historical Embeddings

1 code implementation • 10 Jun 2021 • Matthias Fey, Jan E. Lenssen, Frank Weichert, Jure Leskovec

We present GNNAutoScale (GAS), a framework for scaling arbitrary message-passing GNNs to large graphs.

157

Paper
Code

QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering

4 code implementations • NAACL 2021 • Michihiro Yasunaga, Hongyu Ren, Antoine Bosselut, Percy Liang, Jure Leskovec

The problem of answering questions using knowledge from pre-trained language models (LMs) and knowledge graphs (KGs) presents two challenges: given a QA context (question and answer choice), methods need to (i) identify relevant knowledge from large KGs, and (ii) perform joint reasoning over the QA context and KG.

Ranked #2 on Riddle Sense on RiddleSense

Graph Representation Learning Knowledge Graphs +5

604

Paper
Code

Language-Agnostic Representation Learning of Source Code from Structure and Context

1 code implementation • ICLR 2021 • Daniel Zügner, Tobias Kirschstein, Michele Catasta, Jure Leskovec, Stephan Günnemann

Source code (Context) and its parsed abstract syntax tree (AST; Structure) are two complementary representations of the same computer program.

Code Summarization Representation Learning +1

161

Paper
Code

OGB-LSC: A Large-Scale Challenge for Machine Learning on Graphs

6 code implementations • 17 Mar 2021 • Weihua Hu, Matthias Fey, Hongyu Ren, Maho Nakata, Yuxiao Dong, Jure Leskovec

Enabling effective and efficient machine learning (ML) over large-scale graph data (e. g., graphs with billions of edges) can have a great impact on both industrial and scientific applications.

Ranked #1 on Knowledge Graphs on WikiKG90M-LSC

BIG-bench Machine Learning Graph Learning +4

13,001

Paper
Code

ForceNet: A Graph Neural Network for Large-Scale Quantum Calculations

no code implementations • 2 Mar 2021 • Weihua Hu, Muhammed Shuaibi, Abhishek Das, Siddharth Goyal, Anuroop Sriram, Jure Leskovec, Devi Parikh, C. Lawrence Zitnick

By not imposing explicit physical constraints, we can flexibly design expressive models while maintaining their computational efficiency.

Atomic Forces Computational Efficiency +1

Paper
Add Code

Therapeutics Data Commons: Machine Learning Datasets and Tasks for Drug Discovery and Development

2 code implementations • 18 Feb 2021 • Kexin Huang, Tianfan Fu, Wenhao Gao, Yue Zhao, Yusuf Roohani, Jure Leskovec, Connor W. Coley, Cao Xiao, Jimeng Sun, Marinka Zitnik

Here, we introduce Therapeutics Data Commons (TDC), the first unifying platform to systematically access and evaluate machine learning across the entire range of therapeutics.

BIG-bench Machine Learning Drug Discovery

930

Paper
Code

Model-Agnostic Graph Regularization for Few-Shot Learning

no code implementations • 14 Feb 2021 • Ethan Shen, Maria Brbic, Nicholas Monath, Jiaqi Zhai, Manzil Zaheer, Jure Leskovec

In this paper, we present a comprehensive empirical study on graph embedded few-shot learning.

Ranked #2 on Few-Shot Image Classification on ImageNet-FS (5-shot, novel)

Few-Shot Image Classification Few-Shot Learning

Paper
Add Code

Driver2vec: Driver Identification from Automotive Data

no code implementations • 10 Feb 2021 • Jingbo Yang, Ruge Zhao, Meixian Zhu, David Hallac, Jaka Sodnik, Jure Leskovec

In this paper, we develop a deep learning architecture (Driver2vec) to map a short interval of driving data into an embedding space that represents the driver's behavior to assist in driver identification.

Autonomous Driving Driver Identification

Paper
Add Code

Open-World Semi-Supervised Learning

1 code implementation • ICLR 2022 • Kaidi Cao, Maria Brbic, Jure Leskovec

Here, we introduce a novel open-world semi-supervised learning setting that formalizes the notion that novel classes may appear in the unlabeled test data.

Ranked #2 on Open-World Semi-Supervised Learning on ImageNet-100

Image Classification Novel Object Detection +1

Paper
Code

Identity-aware Graph Neural Networks

1 code implementation • 25 Jan 2021 • Jiaxuan You, Jonathan Gomes-Selman, Rex Ying, Jure Leskovec

However, the expressive power of existing GNNs is upper-bounded by the 1-Weisfeiler-Lehman (1-WL) graph isomorphism test, which means GNNs that are not able to predict node clustering coefficients and shortest path distances, and cannot differentiate between different d-regular graphs.

Graph Classification Graph Property Prediction +3

1,601

Paper
Code

Inductive Representation Learning in Temporal Networks via Causal Anonymous Walks

no code implementations • ICLR 2021 • Yanbang Wang, Yen-Yu Chang, Yunyu Liu, Jure Leskovec, Pan Li

Temporal networks serve as abstractions of many real-world dynamic systems.

Representation Learning Retrieval

Paper
Add Code

ForceNet: A Graph Neural Network for Large-Scale Quantum Chemistry Simulation

no code implementations • 1 Jan 2021 • Weihua Hu, Muhammed Shuaibi, Abhishek Das, Siddharth Goyal, Anuroop Sriram, Jure Leskovec, Devi Parikh, Larry Zitnick

We use ForceNet to perform quantum chemistry simulations, where ForceNet is able to achieve 4x higher success rate than existing ML models.

Atomic Forces

Paper
Add Code

WILDS: A Benchmark of in-the-Wild Distribution Shifts

6 code implementations • 14 Dec 2020 • Pang Wei Koh, Shiori Sagawa, Henrik Marklund, Sang Michael Xie, Marvin Zhang, Akshay Balsubramani, Weihua Hu, Michihiro Yasunaga, Richard Lanas Phillips, Irena Gao, Tony Lee, Etienne David, Ian Stavness, Wei Guo, Berton A. Earnshaw, Imran S. Haque, Sara Beery, Jure Leskovec, Anshul Kundaje, Emma Pierson, Sergey Levine, Chelsea Finn, Percy Liang

Distribution shifts -- where the training distribution differs from the test distribution -- can substantially degrade the accuracy of machine learning (ML) systems deployed in the wild.

1,332

Paper
Code

Coresets for Robust Training of Deep Neural Networks against Noisy Labels

no code implementations • NeurIPS 2020 • Baharan Mirzasoleiman, Kaidi Cao, Jure Leskovec

Modern neural networks have the capacity to overfit noisy labels frequently found in real-world datasets.

Paper
Add Code

Design Space for Graph Neural Networks

2 code implementations • NeurIPS 2020 • Jiaxuan You, Rex Ying, Jure Leskovec

However, current research focuses on proposing and evaluating specific architectural designs of GNNs, as opposed to studying the more general design space of GNNs that consists of a Cartesian product of different design dimensions, such as the number of layers or the type of the aggregation function.

Management

1,601

Paper
Code

Coresets for Robust Training of Neural Networks against Noisy Labels

no code implementations • 15 Nov 2020 • Baharan Mirzasoleiman, Kaidi Cao, Jure Leskovec

Modern neural networks have the capacity to overfit noisy labels frequently found in real-world datasets.

Ranked #36 on Image Classification on mini WebVision 1.0

Image Classification

Paper
Add Code

F-FADE: Frequency Factorization for Anomaly Detection in Edge Streams

1 code implementation • 9 Nov 2020 • Yen-Yu Chang, Pan Li, Rok Sosic, M. H. Afifi, Marco Schweighauser, Jure Leskovec

Edge streams are commonly used to capture interactions in dynamic networks, such as email, social, or computer networks.

Anomaly Detection Anomaly Detection in Edge Streams

Paper
Code

Handling Missing Data with Graph Representation Learning

1 code implementation • NeurIPS 2020 • Jiaxuan You, Xiaobai Ma, Daisy Yi Ding, Mykel Kochenderfer, Jure Leskovec

GRAPE tackles the missing data problem using a graph representation, where the observations and features are viewed as two types of nodes in a bipartite graph, and the observed feature values as edges.

Graph Representation Learning Imputation

127

Paper
Code

Graph Information Bottleneck

1 code implementation • NeurIPS 2020 • Tailin Wu, Hongyu Ren, Pan Li, Jure Leskovec

We design two sampling algorithms for structural regularization and instantiate the GIB principle with two new models: GIB-Cat and GIB-Bern, and demonstrate the benefits by evaluating the resilience to adversarial attacks.

Representation Learning

120

Paper
Code

Beta Embeddings for Multi-Hop Logical Reasoning in Knowledge Graphs

7 code implementations • NeurIPS 2020 • Hongyu Ren, Jure Leskovec

Logical operations are performed in the embedding space by neural operators over the probabilistic embeddings.

Ranked #5 on Complex Query Answering on NELL-995

Complex Query Answering Logical Reasoning +1

256

Paper
Code

Multi-hop Attention Graph Neural Network

1 code implementation • 29 Sep 2020 • Guangtao Wang, Rex Ying, Jing Huang, Jure Leskovec

Currently, at every layer, attention is computed between connected pairs of nodes and depends solely on the representation of the two nodes.

Graph Representation Learning Knowledge Graph Completion +1

Paper
Code

Inductive Learning on Commonsense Knowledge Graph Completion

1 code implementation • 19 Sep 2020 • Bin Wang, Guangtao Wang, Jing Huang, Jiaxuan You, Jure Leskovec, C. -C. Jay Kuo

Here, we propose to study the inductive learning setting for CKG completion where unseen entities may present at test time.

Entity Embeddings Knowledge Graph Completion +2

Paper
Code

Distance Encoding: Design Provably More Powerful Neural Networks for Graph Representation Learning

2 code implementations • NeurIPS 2020 • Pan Li, Yanbang Wang, Hongwei Wang, Jure Leskovec

DE captures the distance between the node set whose representation is to be learned and each node in the graph.

Computational Efficiency Graph Representation Learning +1

179

Paper
Code

OCEAN: Online Task Inference for Compositional Tasks with Context Adaptation

1 code implementation • 17 Aug 2020 • Hongyu Ren, Yuke Zhu, Jure Leskovec, Anima Anandkumar, Animesh Garg

We propose a variational inference framework OCEAN to perform online task inference for compositional tasks.

Variational Inference

Paper
Code

Concept Learners for Few-Shot Learning

2 code implementations • ICLR 2021 • Kaidi Cao, Maria Brbic, Jure Leskovec

Developing algorithms that are able to generalize to a novel task given only a few labeled examples represents a fundamental challenge in closing the gap between machine- and human-level performance.

Few-Shot Learning Fine-Grained Image Classification

111

Paper
Code

Graph Structure of Neural Networks

3 code implementations • ICML 2020 • Jiaxuan You, Jure Leskovec, Kaiming He, Saining Xie

Neural networks are often represented as graphs of connections between neurons.

Clustering

152

Paper
Code

PinnerSage: Multi-Modal User Embedding Framework for Recommendations at Pinterest

no code implementations • 7 Jul 2020 • Aditya Pal, Chantat Eksombatchai, Yitong Zhou, Bo Zhao, Charles Rosenberg, Jure Leskovec

Latent user representations are widely adopted in the tech industry for powering personalized recommender systems.

Clustering Recommendation Systems

Paper
Add Code

Neural Subgraph Matching

no code implementations • 6 Jul 2020 • Rex, Ying, Zhaoyu Lou, Jiaxuan You, Chengtao Wen, Arquimedes Canedo, Jure Leskovec

Subgraph matching is the problem of determining the presence and location(s) of a given query graph in a large target graph.

Paper
Add Code

Improving Query Safety at Pinterest

no code implementations • 20 Jun 2020 • Abhijit Mahabal, Yinrui Li, Rajat Raina, Daniel Sun, Revati Mahajan, Jure Leskovec

Identifying unsafe queries is necessary to protect users from inappropriate query suggestions.

Paper
Add Code

M2P2: Multimodal Persuasion Prediction using Adaptive Fusion

no code implementations • 3 Jun 2020 • Chongyang Bai, Haipeng Chen, Srijan Kumar, Jure Leskovec, V. S. Subrahmanian

Our M2P2 (Multimodal Persuasion Prediction) framework is the first to use multimodal (acoustic, visual, language) data to solve the IPP problem.

Paper
Add Code

Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations

1 code implementation • ‎‎‏‏‎ ‎ 2020 • Jure Leskovec, Jon Kleinberg, Christos Faloutsos

We provide a new graph generator, based on a "forest fire" spreading process, that has a simple, intuitive justification, requires very few parameters (like the "flammability" of nodes), and produces graphs exhibiting the full range of properties observed both in prior work and in the present study.

Graph Generation

691

Paper
Code

Open Graph Benchmark: Datasets for Machine Learning on Graphs

20 code implementations • NeurIPS 2020 • Weihua Hu, Matthias Fey, Marinka Zitnik, Yuxiao Dong, Hongyu Ren, Bowen Liu, Michele Catasta, Jure Leskovec

We present the Open Graph Benchmark (OGB), a diverse set of challenging and realistic benchmark datasets to facilitate scalable, robust, and reproducible graph machine learning (ML) research.

Ranked #1 on Link Property Prediction on ogbl-citation2

Knowledge Graphs Node Property Prediction

1,867

Paper
Code

Learning to Simulate Complex Physics with Graph Networks

12 code implementations • ICML 2020 • Alvaro Sanchez-Gonzalez, Jonathan Godwin, Tobias Pfaff, Rex Ying, Jure Leskovec, Peter W. Battaglia

Here we present a machine learning framework and model implementation that can learn to simulate a wide variety of challenging physical domains, involving fluids, rigid solids, and deformable materials interacting with one another.

12,799

Paper
Code

Unifying Graph Convolutional Neural Networks and Label Propagation

2 code implementations • 17 Feb 2020 • Hongwei Wang, Jure Leskovec

Both solve the task of node classification but LPA propagates node label information across the edges of the graph, while GCN propagates and transforms node feature information.

Ranked #1 on Node Classification on Coauthor Phy

Classification General Classification +1

Paper
Code

Relational Message Passing for Knowledge Graph Completion

4 code implementations • 17 Feb 2020 • Hongwei Wang, Hongyu Ren, Jure Leskovec

Specifically, two kinds of neighborhood topology are modeled for a given entity pair under the relational message passing framework: (1) Relational context, which captures the relation types of edges adjacent to the given entity pair; (2) Relational paths, which characterize the relative position between the given two entities in the knowledge graph.

Knowledge Graph Completion Relation

346

Paper
Code

Query2box: Reasoning over Knowledge Graphs in Vector Space using Box Embeddings

6 code implementations • ICLR 2020 • Hongyu Ren, Weihua Hu, Jure Leskovec

Our main insight is that queries can be embedded as boxes (i. e., hyper-rectangles), where a set of points inside the box corresponds to a set of answer entities of the query.

Ranked #4 on Complex Query Answering on FB15k-237

Complex Query Answering

256

Paper
Code

G2SAT: Learning to Generate SAT Formulas

1 code implementation • NeurIPS 2019 • Jiaxuan You, Haoze Wu, Clark Barrett, Raghuram Ramanujan, Jure Leskovec

The Boolean Satisfiability (SAT) problem is the canonical NP-complete problem and is fundamental to computer science, with a wide array of applications in planning, verification, and theorem proving.

Automated Theorem Proving

Paper
Code

Hyperbolic Graph Convolutional Neural Networks

3 code implementations • NeurIPS 2019 • Ines Chami, Rex Ying, Christopher Ré, Jure Leskovec

Here we propose Hyperbolic Graph Convolutional Neural Network (HGCN), the first inductive hyperbolic GCN that leverages both the expressiveness of GCNs and hyperbolic geometry to learn inductive node representations for hierarchical and scale-free graphs.

Ranked #1 on Link Prediction on PPI (Accuracy metric)

Link Prediction Node Classification

553

Paper
Code

Improving Graph Attention Networks with Large Margin-based Constraints

no code implementations • 25 Oct 2019 • Guangtao Wang, Rex Ying, Jing Huang, Jure Leskovec

Graph Attention Networks (GATs) are the state-of-the-art neural architecture for representation learning with graphs.

Graph Attention Representation Learning

Paper
Add Code

Coresets for Accelerating Incremental Gradient Methods

no code implementations • 25 Sep 2019 • Baharan Mirzasoleiman, Jeff Bilmes, Jure Leskovec

But because at each epoch the gradients are computed only on the subset S, we obtain a speedup that is inversely proportional to the size of S. Our subset selection algorithm is fully general and can be applied to most IG methods.

Open-Ended Question Answering

Paper
Add Code

Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks

1 code implementation • 3 Aug 2019 • Srijan Kumar, Xikun Zhang, Jure Leskovec

However, existing dynamic embedding methods generate embeddings only when users take actions and do not explicitly model the future trajectory of the user/item in the embedding space.

Representation Learning

333

Paper
Code

Selection via Proxy: Efficient Data Selection for Deep Learning

1 code implementation • ICLR 2020 • Cody Coleman, Christopher Yeh, Stephen Mussmann, Baharan Mirzasoleiman, Peter Bailis, Percy Liang, Jure Leskovec, Matei Zaharia

By removing hidden layers from the target model, using smaller architectures, and training for fewer epochs, we create proxies that are an order of magnitude faster to train.

Active Learning Computational Efficiency

Paper
Code

Position-aware Graph Neural Networks

2 code implementations • 11 Jun 2019 • Jiaxuan You, Rex Ying, Jure Leskovec

Learning node embeddings that capture a node's position within the broader graph structure is crucial for many prediction tasks on graphs.

Community Detection Link Prediction +1

13,001

Paper
Code

Redundancy-Free Computation Graphs for Graph Neural Networks

no code implementations • 9 Jun 2019 • Zhihao Jia, Sina Lin, Rex Ying, Jiaxuan You, Jure Leskovec, Alex Aiken

Graph Neural Networks (GNNs) are based on repeated aggregations of information across nodes' neighbors in a graph.

Paper
Add Code

Coresets for Data-efficient Training of Machine Learning Models

3 code implementations • ICML 2020 • Baharan Mirzasoleiman, Jeff Bilmes, Jure Leskovec

Here we develop CRAIG, a method to select a weighted subset (or coreset) of training data that closely estimates the full gradient by maximizing a submodular function.

BIG-bench Machine Learning Open-Ended Question Answering

307

Paper
Code

Strategies for Pre-training Graph Neural Networks

10 code implementations • ICLR 2020 • Weihua Hu, Bowen Liu, Joseph Gomes, Marinka Zitnik, Percy Liang, Vijay Pande, Jure Leskovec

Many applications of machine learning require a model to make accurate pre-dictions on test examples that are distributionally different from training ones, while task-specific labels are scarce during training.

Ranked #3 on Molecular Property Prediction on ToxCast

Graph Classification Molecular Property Prediction +4

918

Paper
Code

Knowledge-aware Graph Neural Networks with Label Smoothness Regularization for Recommender Systems

5 code implementations • 11 May 2019 • Hongwei Wang, Fuzheng Zhang, Mengdi Zhang, Jure Leskovec, Miao Zhao, Wenjie Li, Zhongyuan Wang

Here we propose Knowledge-aware Graph Neural Networks with Label Smoothness regularization (KGNN-LS) to provide better recommendations.

Ranked #1 on Recommendation Systems on Dianping-Food

Feature Engineering Inductive Bias +2

157

Paper
Code

Select Via Proxy: Efficient Data Selection For Training Deep Networks

no code implementations • ICLR 2019 • Cody Coleman, Stephen Mussmann, Baharan Mirzasoleiman, Peter Bailis, Percy Liang, Jure Leskovec, Matei Zaharia

In our approach, we first train a small proxy model quickly, which we then use to estimate the utility of individual training data points, and then select the most informative ones for training the large target model.

BIG-bench Machine Learning Image Classification +1

Paper
Add Code

Hierarchical Temporal Convolutional Networks for Dynamic Recommender Systems

no code implementations • 8 Apr 2019 • Jiaxuan You, Yichen Wang, Aditya Pal, Pong Eksombatchai, Chuck Rosenberg, Jure Leskovec

Recommender systems that can learn from cross-session data to dynamically predict the next item a user will choose are crucial for online platforms.

Session-Based Recommendations

Paper
Add Code

GNNExplainer: Generating Explanations for Graph Neural Networks

10 code implementations • NeurIPS 2019 • Rex Ying, Dylan Bourgeois, Jiaxuan You, Marinka Zitnik, Jure Leskovec

We formulate GNNExplainer as an optimization task that maximizes the mutual information between a GNN's prediction and distribution of possible subgraph structures.

BIG-bench Machine Learning Explainable artificial intelligence +2

13,001

Paper
Code

Learning Dynamic Embeddings from Temporal Interactions

no code implementations • 6 Dec 2018 • Srijan Kumar, Xikun Zhang, Jure Leskovec

Here we present JODIE, a coupled recurrent model to jointly learn the dynamic embeddings of users and items from a sequence of user-item interactions.

Representation Learning

Paper
Add Code

Predicting pregnancy using large-scale data from a women's health tracking mobile application

1 code implementation • 5 Dec 2018 • Bo Liu, Shuyang Shi, Yongshang Wu, Daniel Thomas, Laura Symul, Emma Pierson, Jure Leskovec

Predicting pregnancy has been a fundamental problem in women's health for more than 50 years.

Paper
Code

Complete the Look: Scene-based Complementary Product Recommendation

1 code implementation • CVPR 2019 • Wang-Cheng Kang, Eric Kim, Jure Leskovec, Charles Rosenberg, Julian McAuley

We design an approach to extract training data for this task, and propose a novel way to learn the scene-product compatibility from fashion or interior design images.

Product Recommendation

Paper
Code

How Powerful are Graph Neural Networks?

18 code implementations • ICLR 2019 • Keyulu Xu, Weihua Hu, Jure Leskovec, Stefanie Jegelka

Here, we present a theoretical framework for analyzing the expressive power of GNNs to capture different graph structures.

Ranked #1 on Graph Classification on COX2

General Classification Graph Classification +3

13,001

Paper
Code

MASA: Motif-Aware State Assignment in Noisy Time Series Data

1 code implementation • 6 Sep 2018 • Saachi Jain, David Hallac, Rok Sosic, Jure Leskovec

Such data can be interpreted as a sequence of states, where each state represents a prototype of system behavior.

Clustering Time Series +1

Paper
Code

Inferring Multidimensional Rates of Aging from Cross-Sectional Data

1 code implementation • 12 Jul 2018 • Emma Pierson, Pang Wei Koh, Tatsunori Hashimoto, Daphne Koller, Jure Leskovec, Nicholas Eriksson, Percy Liang

Motivated by the study of human aging, we present an interpretable latent-variable model that learns temporal dynamics from cross-sectional data.

Human Aging Time Series +1

Paper
Code

Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities

no code implementations • 30 Jun 2018 • Marinka Zitnik, Francis Nguyen, Bo wang, Jure Leskovec, Anna Goldenberg, Michael M. Hoffman

In this Review, we describe the principles of data integration and discuss current methods and available implementations.

BIG-bench Machine Learning Data Integration

Paper
Add Code

Hierarchical Graph Representation Learning with Differentiable Pooling

14 code implementations • NeurIPS 2018 • Rex Ying, Jiaxuan You, Christopher Morris, Xiang Ren, William L. Hamilton, Jure Leskovec

Recently, graph neural networks (GNNs) have revolutionized the field of graph representation learning through effectively learned node embeddings, and achieved state-of-the-art results in tasks such as node classification and link prediction.

Ranked #1 on Graph Classification on REDDIT-MULTI-12K

General Classification Graph Classification +3

13,001

Paper
Code

Drive2Vec: Multiscale State-Space Embedding of Vehicular Sensor Data

no code implementations • 12 Jun 2018 • David Hallac, Suvrat Bhooshan, Michael Chen, Kacem Abida, Rok Sosic, Jure Leskovec

With automobiles becoming increasingly reliant on sensors to perform various driving tasks, it is important to encode the relevant CAN bus sensor data in a way that captures the general state of the vehicle in a compact form.

Paper
Add Code

Graph Convolutional Policy Network for Goal-Directed Molecular Graph Generation

2 code implementations • NeurIPS 2018 • Jiaxuan You, Bowen Liu, Rex Ying, Vijay Pande, Jure Leskovec

Generating novel graph structures that optimize given objectives while obeying some given underlying rules is fundamental for chemistry, biology and social science research.

Graph Generation Molecular Graph Generation

338

Paper
Code

Graph Convolutional Neural Networks for Web-Scale Recommender Systems

5 code implementations • 6 Jun 2018 • Rex Ying, Ruining He, Kai-Feng Chen, Pong Eksombatchai, William L. Hamilton, Jure Leskovec

We develop a data-efficient Graph Convolutional Network (GCN) algorithm PinSage, which combines efficient random walks and graph convolutions to generate embeddings of nodes (i. e., items) that incorporate both graph structure as well as node feature information.

Recommendation Systems

13,001

Paper
Code

Embedding Logical Queries on Knowledge Graphs

5 code implementations • NeurIPS 2018 • William L. Hamilton, Payal Bajaj, Marinka Zitnik, Dan Jurafsky, Jure Leskovec

Learning low-dimensional embeddings of knowledge graphs is a powerful approach used to predict unobserved or missing edges between entities.

Ranked #6 on Complex Query Answering on FB15k-237

Complex Query Answering

256

Paper
Code

Dynamic Network Model from Partial Observations

no code implementations • NeurIPS 2018 • Elahe Ghalebi, Baharan Mirzasoleiman, Radu Grosu, Jure Leskovec

We propose a novel framework for providing a non-parametric dynamic network model--based on a mixture of coupled hierarchical Dirichlet processes-- based on data capturing cascade node infection times.

Open-Ended Question Answering

Paper
Add Code

Network Enhancement: a general method to denoise weighted biological networks

no code implementations • 9 May 2018 • Bo Wang, Armin Pourshafeie, Marinka Zitnik, Junjie Zhu, Carlos D. Bustamante, Serafim Batzoglou, Jure Leskovec

Networks are ubiquitous in biology where they encode connectivity patterns at all scales of organization, from molecular to the biome.

Denoising

Paper
Add Code

Prioritizing network communities

no code implementations • 7 May 2018 • Marinka Zitnik, Rok Sosic, Jure Leskovec

Community detection identifies candidate modules as hypotheses, which then need to be validated through experiments, such as mutagenesis in a biological laboratory.

Community Detection

Paper
Add Code

Community Interaction and Conflict on the Web

no code implementations • 9 Mar 2018 • Srijan Kumar, William L. Hamilton, Jure Leskovec, Dan Jurafsky

Here we study intercommunity interactions across 36, 000 communities on Reddit, examining cases where users of one community are mobilized by negative sentiment to comment in another community.

Paper
Add Code

GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models

3 code implementations • ICML 2018 • Jiaxuan You, Rex Ying, Xiang Ren, William L. Hamilton, Jure Leskovec

Modeling and generating graphs is fundamental for studying networks in biology, engineering, and social sciences.

Graph Generation

672

Paper
Code

Modeling polypharmacy side effects with graph convolutional networks

1 code implementation • 2 Feb 2018 • Marinka Zitnik, Monica Agrawal, Jure Leskovec

The approach constructs a multimodal graph of protein-protein interactions, drug-protein target interactions, and the polypharmacy side effects, which are represented as drug-drug interactions, where each side effect is an edge of a different type.

Ranked #1 on Link Prediction on Decagon

Link Prediction

436

Paper
Code

Spectral Graph Wavelets for Structural Role Similarity in Networks

no code implementations • ICLR 2018 • Claire Donnat, Marinka Zitnik, David Hallac, Jure Leskovec

Nodes residing in different parts of a graph can have similar structural roles within their local network topology.

Paper
Add Code

Large-scale analysis of disease pathways in the human interactome

no code implementations • 3 Dec 2017 • Monica Agrawal, Marinka Zitnik, Jure Leskovec

However, we show that higher-order network structures, such as small subgraphs of the pathway, provide a promising direction for the development of new methods.

Paper
Add Code

Pixie: A System for Recommending 3+ Billion Items to 200+ Million Users in Real-Time

1 code implementation • 21 Nov 2017 • Chantat Eksombatchai, Pranav Jindal, Jerry Zitao Liu, Yuchen Liu, Rahul Sharma, Charles Sugnet, Mark Ulrich, Jure Leskovec

Furthermore, we develop a graph pruning strategy at that leads to an additional 58% improvement in recommendations.

Recommendation Systems

Paper
Code

Learning Structural Node Embeddings Via Diffusion Wavelets

1 code implementation • KDD 2018 • Claire Donnat, Marinka Zitnik, David Hallac, Jure Leskovec

Nodes residing in different parts of a graph can have similar structural roles within their local network topology.

182

Paper
Code

Data-Driven Model Predictive Control of Autonomous Mobility-on-Demand Systems

no code implementations • 20 Sep 2017 • Ramon Iglesias, Federico Rossi, Kevin Wang, David Hallac, Jure Leskovec, Marco Pavone

The goal of this paper is to present an end-to-end, data-driven framework to control Autonomous Mobility-on-Demand systems (AMoD, i. e. fleets of self-driving vehicles).

Robotics Multiagent Systems Systems and Control Applications

Paper
Add Code

Representation Learning on Graphs: Methods and Applications

no code implementations • 17 Sep 2017 • William L. Hamilton, Rex Ying, Jure Leskovec

Machine learning on graphs is an important and ubiquitous task with applications ranging from drug design to friendship recommendation in social networks.

BIG-bench Machine Learning Dimensionality Reduction +1

Paper
Add Code

Predicting multicellular function through multi-layer tissue networks

1 code implementation • 14 Jul 2017 • Marinka Zitnik, Jure Leskovec

We use OhmNet to study multicellular function in a multi-layer protein interaction network of 107 human tissues.

372

Paper
Code

Interpretable & Explorable Approximations of Black Box Models

no code implementations • 4 Jul 2017 • Himabindu Lakkaraju, Ece Kamar, Rich Caruana, Jure Leskovec

To the best of our knowledge, this is the first approach which can produce global explanations of the behavior of any given black box model through joint optimization of unambiguity, fidelity, and interpretability, while also allowing users to explore model behavior based on their preferences.

Paper
Add Code

Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data

no code implementations • 10 Jun 2017 • David Hallac, Sagar Vare, Stephen Boyd, Jure Leskovec

We derive closed-form solutions to efficiently solve the two resulting subproblems in a scalable way, through dynamic programming and the alternating direction method of multipliers (ADMM), respectively.

Clustering Time Series +1

Paper
Add Code

Driver Identification Using Automobile Sensor Data from a Single Turn

no code implementations • 9 Jun 2017 • David Hallac, Abhijit Sharang, Rainer Stahlmann, Andreas Lamprecht, Markus Huber, Martin Roehder, Rok Sosic, Jure Leskovec

In this paper, we propose a method to predict, from sensor data collected at a single turn, the identity of a driver out of a given set of individuals.

Driver Identification Navigate +2

Paper
Add Code

Inductive Representation Learning on Large Graphs

18 code implementations • NeurIPS 2017 • William L. Hamilton, Rex Ying, Jure Leskovec

Low-dimensional embeddings of nodes in large graphs have proved extremely useful in a variety of prediction tasks, from content recommendation to identifying protein functions.

Ranked #1 on Link Property Prediction on ogbl-ddi

Graph Classification Graph Regression +5

20,120

Paper
Code

Community Identity and User Engagement in a Multi-Community Landscape

no code implementations • 26 May 2017 • Justine Zhang, William L. Hamilton, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky, Jure Leskovec

To this end we introduce a quantitative, language-based typology reflecting two key aspects of a community's identity: how distinctive, and how temporally dynamic it is.

Paper
Add Code

Higher-order clustering in networks

no code implementations • 12 Apr 2017 • Hao Yin, Austin R. Benson, Jure Leskovec

Here we introduce higher-order clustering coefficients that measure the closure probability of higher-order network cliques and provide a more comprehensive view of how the edges of complex networks cluster.

Clustering

Paper
Add Code

An Army of Me: Sockpuppets in Online Discussion Communities

no code implementations • 21 Mar 2017 • Srijan Kumar, Justin Cheng, Jure Leskovec, V. S. Subrahmanian

Further, pairs of sockpuppets controlled by the same individual are more likely to interact on the same discussion at the same time than pairs of ordinary users.

Paper
Add Code

Loyalty in Online Communities

1 code implementation • 9 Mar 2017 • William L. Hamilton, Justine Zhang, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky, Jure Leskovec

In this paper we operationalize loyalty as a user-community relation: users loyal to a community consistently prefer it over all others; loyal communities retain their loyal users over time.

Paper
Code

Network Inference via the Time-Varying Graphical Lasso

1 code implementation • 6 Mar 2017 • David Hallac, Youngsuk Park, Stephen Boyd, Jure Leskovec

Many important problems can be modeled as a system of interconnected entities, where each entity is recording time-dependent observations or measurements.

Time Series Time Series Analysis

Paper
Code

Motifs in Temporal Networks

no code implementations • 29 Dec 2016 • Ashwin Paranjape, Austin R. Benson, Jure Leskovec

Networks are a fundamental tool for modeling complex systems in a variety of domains including social and communication networks as well as biology and neuroscience.

Paper
Add Code

Higher-order organization of complex networks

no code implementations • 26 Dec 2016 • Austin R. Benson, David F. Gleich, Jure Leskovec

Many networks are known to exhibit rich, lower-order connectivity patterns that can be captured at the level of individual nodes and edges.

Social and Information Networks Discrete Mathematics Physics and Society

Paper
Add Code

Confusions over Time: An Interpretable Bayesian Model to Characterize Trends in Decision Making

no code implementations • NeurIPS 2016 • Himabindu Lakkaraju, Jure Leskovec

We propose Confusions over Time (CoT), a novel generative framework which facilitates a multi-granular analysis of the decision making process.

Decision Making

Paper
Add Code

Learning Linguistic Descriptors of User Roles in Online Communities

no code implementations • WS 2016 • Alex Wang, William L. Hamilton, Jure Leskovec

Paper
Add Code

node2vec: Scalable Feature Learning for Networks

18 code implementations • 3 Jul 2016 • Aditya Grover, Jure Leskovec

Taken together, our work represents a new way for efficiently learning state-of-the-art task-independent representations in complex networks.

Ranked #1 on Node Property Prediction on ogbn-proteins

Link Prediction Multi-Label Classification +2

13,001

Paper
Code

Inducing Domain-Specific Sentiment Lexicons from Unlabeled Corpora

1 code implementation • EMNLP 2016 • William L. Hamilton, Kevin Clark, Jure Leskovec, Dan Jurafsky

A word's sentiment depends on the domain in which it is used.

Word Embeddings

195

Paper
Code

Cultural Shift or Linguistic Drift? Comparing Two Computational Measures of Semantic Change

no code implementations • EMNLP 2016 • William L. Hamilton, Jure Leskovec, Dan Jurafsky

Words shift in meaning for many reasons, including cultural factors like new technologies and regular linguistic processes like subjectification.

Cultural Vocal Bursts Intensity Prediction

Paper
Add Code

Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change

4 code implementations • ACL 2016 • William L. Hamilton, Jure Leskovec, Dan Jurafsky

Understanding how words change their meanings over time is key to models of language and cultural evolution, but historical data on meaning is scarce, making theories hard to develop and test.

Diachronic Word Embeddings Word Embeddings

404

Paper
Code

Large-scale Analysis of Counseling Conversations: An Application of Natural Language Processing to Mental Health

no code implementations • TACL 2016 • Tim Althoff, Kevin Clark, Jure Leskovec

Mental illness is one of the most pressing public health issues of our time.

Clustering Language Modelling

Paper
Add Code

Growing Wikipedia Across Languages via Recommendation

2 code implementations • 12 Apr 2016 • Ellery Wulczyn, Robert West, Leila Zia, Jure Leskovec

The system involves identifying missing articles, ranking the missing articles according to their importance, and recommending important missing articles to editors based on their interests.

Social and Information Networks Digital Libraries

Paper
Code

Do Cascades Recur?

no code implementations • 2 Feb 2016 • Justin Cheng, Lada A. Adamic, Jon Kleinberg, Jure Leskovec

In this paper, we perform a large-scale analysis of cascades on Facebook over significantly longer time scales, and find that a more complex picture emerges, in which many large cascades recur, exhibiting multiple bursts of popularity with periods of quiescence in between.

Paper
Add Code

SEISMIC: A Self-Exciting Point Process Model for Predicting Tweet Popularity

1 code implementation • 8 Jun 2015 • Qingyuan Zhao, Murat A. Erdogdu, Hera Y. He, Anand Rajaraman, Jure Leskovec

Social networking websites allow users to create and share content.

Social and Information Networks Physics and Society Applications 60G55, 62P25 H.2.8

Paper
Code

QUOTUS: The Structure of Political Media Coverage as Revealed by Quoting Patterns

no code implementations • 6 Apr 2015 • Vlad Niculae, Caroline Suen, Justine Zhang, Cristian Danescu-Niculescu-Mizil, Jure Leskovec

By encoding bias patterns in a low-rank space we provide an analysis of the structure of political media coverage.

Paper
Add Code

Antisocial Behavior in Online Discussion Communities

no code implementations • 2 Apr 2015 • Justin Cheng, Cristian Danescu-Niculescu-Mizil, Jure Leskovec

User contributions in the form of posts, comments, and votes are essential to the success of online communities.

Paper
Add Code

Exploiting Social Network Structure for Person-to-Person Sentiment Analysis

no code implementations • TACL 2014 • Robert West, Hristo S. Paskov, Jure Leskovec, Christopher Potts

Person-to-person evaluations are prevalent in all kinds of discourse and important for establishing reputations, building social bonds, and shaping public opinion.

Decision Making Sentiment Analysis

Paper
Add Code

How Community Feedback Shapes User Behavior

no code implementations • 6 May 2014 • Justin Cheng, Cristian Danescu-Niculescu-Mizil, Jure Leskovec

Interestingly, the authors that receive no feedback are most likely to leave a community.

Paper
Add Code

Can Cascades be Predicted?

no code implementations • 18 Mar 2014 • Justin Cheng, Lada A. Adamic, P. Alex Dow, Jon Kleinberg, Jure Leskovec

On a large sample of photo reshare cascades on Facebook, we find strong performance in predicting whether a cascade will continue to grow in the future.

Paper
Add Code

Engaging with Massive Online Courses

no code implementations • 12 Mar 2014 • Ashton Anderson, Daniel Huttenlocher, Jon Kleinberg, Jure Leskovec

We also report on a large-scale deployment of badges as incentives for engagement in a MOOC, including randomized experiments in which the presentation of badges was varied across sub-populations.

Paper
Add Code

The Bursty Dynamics of the Twitter Information Network

no code implementations • 11 Mar 2014 • Seth A. Myers, Jure Leskovec

Information diffusion in the form of cascades of post re-sharing often creates such sudden bursts of new connections, which significantly change users' local network structure.

Paper
Add Code

Nonparametric Multi-group Membership Model for Dynamic Networks

no code implementations • NeurIPS 2013 • Myunghwan Kim, Jure Leskovec

Relational data-like graphs, networks, and matrices-is often dynamic, where the relational structure evolves over time.

Link Prediction

Paper
Add Code

A Computational Approach to Politeness with Application to Social Factors

no code implementations • ACL 2013 • Cristian Danescu-Niculescu-Mizil, Moritz Sudhof, Dan Jurafsky, Jure Leskovec, Christopher Potts

We propose a computational framework for identifying linguistic aspects of politeness.

Paper
Add Code

Modeling Information Propagation with Survival Theory

no code implementations • 15 May 2013 • Manuel Gomez Rodriguez, Jure Leskovec, Bernhard Schoelkopf

Networks provide a skeleton for the spread of contagions, like, information, ideas, behaviors and diseases.

Paper
Add Code

From Amateurs to Connoisseurs: Modeling the Evolution of User Expertise through Online Reviews

no code implementations • 18 Mar 2013 • Julian McAuley, Jure Leskovec

Recommending products to consumers means not only understanding their tastes, but also understanding their level of experience.

Paper
Add Code

Structure and Dynamics of Information Pathways in Online Media

no code implementations • 6 Dec 2012 • Manuel Gomez Rodriguez, Jure Leskovec, Bernhard Schölkopf

We assume there is an unobserved dynamic network that changes over time, while we observe the results of a dynamic process spreading over the edges of the network.

Paper
Add Code

Learning to Discover Social Circles in Ego Networks

no code implementations • NeurIPS 2012 • Jure Leskovec, Julian J. McAuley

We develop a model for detecting circles that combines network structure as well as user profile information.

Clustering Node Clustering

Paper
Add Code

Image Labeling on a Network: Using Social-Network Metadata for Image Classification

no code implementations • 16 Jul 2012 • Julian McAuley, Jure Leskovec

Many of these benchmarks are derived from online photo sharing networks, like Flickr, which in addition to hosting images also provide a highly interactive social community.

General Classification Image Classification +2

Paper
Add Code

Defining and Evaluating Network Communities based on Ground-truth

no code implementations • 28 May 2012 • Jaewon Yang, Jure Leskovec

Identifying such communities of nodes has proven to be a challenging task mainly due to a plethora of definitions of a community, intractability of algorithms, issues with evaluation and the lack of a reliable gold-standard ground-truth.

Social and Information Networks Physics and Society

Paper
Add Code

On the Convexity of Latent Social Network Inference

no code implementations • NeurIPS 2010 • Seth Myers, Jure Leskovec

Moreover, our approach scales well as it can infer optimal networks on thousands of nodes in a matter of minutes.

Paper
Add Code

Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters

no code implementations • 8 Oct 2008 • Jure Leskovec, Kevin J. Lang, Anirban Dasgupta, Michael W. Mahoney

A large body of work has been devoted to defining and identifying clusters or communities in social and information networks.

Data Structures and Algorithms Data Analysis, Statistics and Probability Physics and Society

Paper
Add Code

Cost-effective Outbreak Detection in Networks

1 code implementation • SIGKDD 2007 • Jure Leskovec, Andreas Krause, Carlos Guestrin, Christos Faloutsos, Jeanne VanBriesen, Natalie Glance

We show that the approach scales, achieving speedups and savings in storage of several orders of magnitude.

Paper
Code

Sampling From Large Graphs

1 code implementation • KDD 2006 • Jure Leskovec, Christos Faloutsos

Thus graph sampling is essential. The natural questions to ask are (a) which sampling method to use, (b) how small can the sample size be, and (c) how to scale up the measurements of the sample (e. g., the diameter), to get estimates for the large graph.

Graph Sampling Natural Questions

691

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.