Search Results for author: Jure Leskovec

Found 177 papers, 99 papers with code

STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases

1 code implementation19 Apr 2024 Shirley Wu, Shiyu Zhao, Michihiro Yasunaga, Kexin Huang, Kaidi Cao, Qian Huang, Vassilis N. Ioannidis, Karthik Subbian, James Zou, Jure Leskovec

Answering real-world user queries, such as product search, often requires accurate retrieval of information from semi-structured knowledge bases or databases that involve blend of unstructured (e. g., textual descriptions of products) and structured (e. g., entity relations of products) information.

Benchmarking Retrieval

From Similarity to Superiority: Channel Clustering for Time Series Forecasting

no code implementations31 Mar 2024 Jialin Chen, Jan Eric Lenssen, Aosong Feng, Weihua Hu, Matthias Fey, Leandros Tassiulas, Jure Leskovec, Rex Ying

Motivated by our observation of a correlation between the time series model's performance boost against channel mixing and the intrinsic similarity on a pair of channels, we developed a novel and adaptable Channel Clustering Module (CCM).

Clustering Time Series +1

Inferring Dynamic Networks from Marginals with Iterative Proportional Fitting

1 code implementation28 Feb 2024 Serina Chang, Frederic Koehler, Zhaonan Qu, Jure Leskovec, Johan Ugander

A common network inference problem, arising from real-world data constraints, is how to infer a dynamic network from its time-aggregated adjacency matrix and time-varying marginals (i. e., row and column sums).

Representation Learning for Frequent Subgraph Mining

no code implementations22 Feb 2024 Rex Ying, Tianyu Fu, Andrew Wang, Jiaxuan You, Yu Wang, Jure Leskovec

SPMiner combines graph neural networks, order embedding space, and an efficient search strategy to identify network subgraph patterns that appear most frequently in the target graph.

Representation Learning Subgraph Counting

Uncertainty Quantification for Forward and Inverse Problems of PDEs via Latent Global Evolution

2 code implementations13 Feb 2024 Tailin Wu, Willie Neiswanger, Hongtao Zheng, Stefano Ermon, Jure Leskovec

Deep learning-based surrogate models have demonstrated remarkable advantages over classical solvers in terms of speed, often achieving speedups of 10 to 1000 times over traditional partial differential equation (PDE) solvers.

Decision Making Uncertainty Quantification

Compositional Generative Inverse Design

1 code implementation24 Jan 2024 Tailin Wu, Takashi Maruyama, Long Wei, Tao Zhang, Yilun Du, Gianluca Iaccarino, Jure Leskovec

In an N-body interaction task and a challenging 2D multi-airfoil design task, we demonstrate that by composing the learned diffusion model at test time, our method allows us to design initial states and boundary shapes that are more complex than those in the training data.

Equivariant Graph Neural Operator for Modeling 3D Dynamics

no code implementations19 Jan 2024 Minkai Xu, Jiaqi Han, Aaron Lou, Jean Kossaifi, Arvind Ramanathan, Kamyar Azizzadenesheli, Jure Leskovec, Stefano Ermon, Anima Anandkumar

Modeling the complex three-dimensional (3D) dynamics of relational systems is an important problem in the natural sciences, with applications ranging from molecular simulations to particle mechanics.

Operator learning

TimeGraphs: Graph-based Temporal Reasoning

no code implementations6 Jan 2024 Paridhi Maheshwari, Hongyu Ren, Yanan Wang, Rok Sosic, Jure Leskovec

The results demonstrate both robustness and efficiency of TimeGraphs on a range of temporal reasoning tasks.

Zero-shot Generalization

Relational Deep Learning: Graph Representation Learning on Relational Databases

no code implementations7 Dec 2023 Matthias Fey, Weihua Hu, Kexin Huang, Jan Eric Lenssen, Rishabh Ranjan, Joshua Robinson, Rex Ying, Jiaxuan You, Jure Leskovec

The core idea is to view relational databases as a temporal, heterogeneous graph, with a node for each row in each table, and edges specified by primary-foreign key links.

Feature Engineering Graph Representation Learning

GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts

no code implementations7 Dec 2023 Shirley Wu, Kaidi Cao, Bruno Ribeiro, James Zou, Jure Leskovec

Graph data are inherently complex and heterogeneous, leading to a high natural diversity of distributional shifts.

Context-Aware Meta-Learning

1 code implementation17 Oct 2023 Christopher Fifty, Dennis Duan, Ronald G. Junkins, Ehsan Amid, Jure Leskovec, Christopher Re, Sebastian Thrun

Large Language Models like ChatGPT demonstrate a remarkable capacity to learn new concepts during inference without any fine-tuning.

Few-Shot Image Classification In-Context Learning +1

In-Context Learning for Few-Shot Molecular Property Prediction

no code implementations13 Oct 2023 Christopher Fifty, Jure Leskovec, Sebastian Thrun

In this paper, we adapt the concepts underpinning in-context learning to develop a new algorithm for few-shot molecular property prediction.

Few-Shot Learning In-Context Learning +2

MLAgentBench: Evaluating Language Agents on Machine Learning Experimentation

1 code implementation5 Oct 2023 Qian Huang, Jian Vora, Percy Liang, Jure Leskovec

A central aspect of machine learning research is experimentation, the process of designing and running experiments, analyzing the results, and iterating towards some positive outcome (e. g., improving accuracy).

Benchmarking Decision Making +1

Large Language Models as Analogical Reasoners

no code implementations3 Oct 2023 Michihiro Yasunaga, Xinyun Chen, Yujia Li, Panupong Pasupat, Jure Leskovec, Percy Liang, Ed H. Chi, Denny Zhou

Chain-of-thought (CoT) prompting for language models demonstrates impressive performance across reasoning tasks, but typically needs labeled exemplars of the reasoning process.

Code Generation GSM8K +1

Communication-Free Distributed GNN Training with Vertex Cut

no code implementations6 Aug 2023 Kaidi Cao, Rui Deng, Shirley Wu, Edward W Huang, Karthik Subbian, Jure Leskovec

Here, we introduce CoFree-GNN, a novel distributed GNN training framework that significantly speeds up the training process by implementing communication-free training.

Node Classification

VQGraph: Rethinking Graph Representation Space for Bridging GNNs and MLPs

1 code implementation4 Aug 2023 Ling Yang, Ye Tian, Minkai Xu, Zhongyi Liu, Shenda Hong, Wei Qu, Wentao Zhang, Bin Cui, Muhan Zhang, Jure Leskovec

To address this issue, we propose to learn a new powerful graph representation space by directly labeling nodes' diverse local structures for GNN-to-MLP distillation.

Knowledge Distillation Quantization +1

Med-Flamingo: a Multimodal Medical Few-shot Learner

1 code implementation27 Jul 2023 Michael Moor, Qian Huang, Shirley Wu, Michihiro Yasunaga, Cyril Zakka, Yash Dalmia, Eduardo Pontes Reis, Pranav Rajpurkar, Jure Leskovec

However, existing models typically have to be fine-tuned on sizeable down-stream datasets, which poses a significant limitation as in many medical applications data is scarce, necessitating models that are capable of learning from few examples in real-time.

Medical Visual Question Answering Question Answering +1

Temporal Graph Benchmark for Machine Learning on Temporal Graphs

2 code implementations NeurIPS 2023 Shenyang Huang, Farimah Poursafaei, Jacob Danovitch, Matthias Fey, Weihua Hu, Emanuele Rossi, Jure Leskovec, Michael Bronstein, Guillaume Rabusseau, Reihaneh Rabbany

We present the Temporal Graph Benchmark (TGB), a collection of challenging and diverse benchmark datasets for realistic, reproducible, and robust evaluation of machine learning models on temporal graphs.

Node Property Prediction Property Prediction

Enabling tabular deep learning when $d \gg n$ with an auxiliary knowledge graph

no code implementations7 Jun 2023 Camilo Ruiz, Hongyu Ren, Kexin Huang, Jure Leskovec

However, for tabular datasets with extremely high $d$-dimensional features but limited $n$ samples (i. e. $d \gg n$), machine learning models struggle to achieve strong performance due to the risk of overfitting.

Inductive Bias

Uncertainty Quantification over Graph with Conformalized Graph Neural Networks

1 code implementation NeurIPS 2023 Kexin Huang, Ying Jin, Emmanuel Candès, Jure Leskovec

We establish a permutation invariance condition that enables the validity of CP on graph data and provide an exact characterization of the test-time coverage.

Conformal Prediction Uncertainty Quantification +1

PRODIGY: Enabling In-context Learning Over Graphs

no code implementations NeurIPS 2023 Qian Huang, Hongyu Ren, Peng Chen, Gregor Kržmanc, Daniel Zeng, Percy Liang, Jure Leskovec

In-context learning is the ability of a pretrained model to adapt to novel and diverse downstream tasks by conditioning on prompt examples, without optimizing any parameters.

In-Context Learning Knowledge Graphs

Learning Large Graph Property Prediction via Graph Segment Training

1 code implementation NeurIPS 2023 Kaidi Cao, Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Dustin Zelle, Yanqi Zhou, Charith Mendis, Jure Leskovec, Bryan Perozzi

Here we propose Graph Segment Training (GST), a general framework that utilizes a divide-and-conquer approach to allow learning large graph property prediction with a constant memory footprint.

Graph Property Prediction Property Prediction

Geometric Latent Diffusion Models for 3D Molecule Generation

2 code implementations2 May 2023 Minkai Xu, Alexander Powers, Ron Dror, Stefano Ermon, Jure Leskovec

Generative models, especially diffusion models (DMs), have achieved promising results for generating feature-rich geometries and advancing foundational science problems such as molecule design.

3D Molecule Generation valid

Learning Controllable Adaptive Simulation for Multi-resolution Physics

1 code implementation1 May 2023 Tailin Wu, Takashi Maruyama, Qingqing Zhao, Gordon Wetzstein, Jure Leskovec

In this work, we introduce Learning controllable Adaptive simulation for Multi-resolution Physics (LAMP) as the first full deep learning-based surrogate model that jointly learns the evolution model and optimizes appropriate spatial resolutions that devote more compute to the highly dynamic regions.

When Do Graph Neural Networks Help with Node Classification? Investigating the Impact of Homophily Principle on Node Distinguishability

1 code implementation25 Apr 2023 Sitao Luan, Chenqing Hua, Minkai Xu, Qincheng Lu, Jiaqi Zhu, Xiao-Wen Chang, Jie Fu, Jure Leskovec, Doina Precup

Homophily principle, i. e., nodes with the same labels are more likely to be connected, has been believed to be the main reason for the performance superiority of Graph Neural Networks (GNNs) over Neural Networks on node classification tasks.

Node Classification Stochastic Block Model

Neural Graph Reasoning: Complex Logical Query Answering Meets Graph Databases

1 code implementation26 Mar 2023 Hongyu Ren, Mikhail Galkin, Michael Cochez, Zhaocheng Zhu, Jure Leskovec

Extending the idea of graph databases (graph DBs), NGDB consists of a Neural Graph Storage and a Neural Graph Engine.

Link Prediction Logical Reasoning +1

AutoTransfer: AutoML with Knowledge Transfer -- An Application to Graph Neural Networks

1 code implementation14 Mar 2023 Kaidi Cao, Jiaxuan You, Jiaju Liu, Jure Leskovec

Experiments demonstrate that (i) our proposed task embedding can be computed efficiently, and that tasks with similar embeddings have similar best-performing architectures; (ii) AutoTransfer significantly improves search efficiency with the transferred design priors, reducing the number of explored architectures by an order of magnitude.

AutoML Transfer Learning

Relational Multi-Task Learning: Modeling Relations between Data and Tasks

1 code implementation ICLR 2022 Kaidi Cao, Jiaxuan You, Jure Leskovec

Here we introduce a novel relational multi-task learning setting where we leverage data point labels from auxiliary tasks to make more accurate predictions on the new task.

Multi-Task Learning

Implicit Geometry and Interaction Embeddings Improve Few-Shot Molecular Property Prediction

1 code implementation4 Feb 2023 Christopher Fifty, Joseph M. Paggi, Ehsan Amid, Jure Leskovec, Ron Dror

However, many important molecular properties depend on complex molecular characteristics -- such as the various 3D geometries a molecule may adopt or the types of chemical interactions it can form -- that are not explicitly encoded in the feature space and must be approximated from low amounts of data.

Few-Shot Learning Molecular Docking +4

Zero-shot causal learning

1 code implementation NeurIPS 2023 Hamed Nilforoshan, Michael Moor, Yusuf Roohani, Yining Chen, Anja Šurina, Michihiro Yasunaga, Sara Oblak, Jure Leskovec

There are a large number of methods to predict the effect of an existing intervention based on historical data from individuals who received it.

Marketing Meta-Learning

Learning Graph Search Heuristics

no code implementations Learning on Graphs 2022 Michal Pándy, Weikang Qiu, Gabriele Corso, Petar Veličković, Rex Ying, Jure Leskovec, Pietro Liò

At training time, we aggregate datasets of search trajectories and ground-truth shortest path distances, which we use to train a specialized graph neural network-based heuristic function using backpropagation through steps of the pathfinding process.

Graph Representation Learning Imitation Learning

Retrieval-Augmented Multimodal Language Modeling

no code implementations22 Nov 2022 Michihiro Yasunaga, Armen Aghajanyan, Weijia Shi, Rich James, Jure Leskovec, Percy Liang, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih

To integrate knowledge in a more scalable and modular way, we propose a retrieval-augmented multimodal model, which enables a base multimodal model (generator) to refer to relevant text and images fetched by a retriever from external memory (e. g., documents on the web).

Caption Generation Image Captioning +5

TuneUp: A Simple Improved Training Strategy for Graph Neural Networks

no code implementations26 Oct 2022 Weihua Hu, Kaidi Cao, Kexin Huang, Edward W Huang, Karthik Subbian, Kenji Kawaguchi, Jure Leskovec

Extensive evaluation of TuneUp on five diverse GNN architectures, three types of prediction tasks, and both transductive and inductive settings shows that TuneUp significantly improves the performance of the base GNN on tail nodes, while often even improving the performance on head nodes.

Data Augmentation

Efficient Automatic Machine Learning via Design Graphs

1 code implementation21 Oct 2022 Shirley Wu, Jiaxuan You, Jure Leskovec, Rex Ying

FALCON features 1) a task-agnostic module, which performs message passing on the design graph via a Graph Neural Network (GNN), and 2) a task-specific module, which conducts label propagation of the known model performance information on the design graph.

AutoML Graph Classification +1

ROLAND: Graph Learning Framework for Dynamic Graphs

2 code implementations15 Aug 2022 Jiaxuan You, Tianyu Du, Jure Leskovec

Finally, we propose a scalable and efficient training approach for dynamic GNNs via incremental training and meta-learning.

Graph Learning Graph Representation Learning +2

ViRel: Unsupervised Visual Relations Discovery with Graph-level Analogy

no code implementations4 Jul 2022 Daniel Zeng, Tailin Wu, Jure Leskovec

Here, we introduce ViRel, a method for unsupervised discovery and learning of Visual Relations with graph-level analogy.

Relation Relation Classification

ZeroC: A Neuro-Symbolic Model for Zero-shot Concept Recognition and Acquisition at Inference Time

1 code implementation30 Jun 2022 Tailin Wu, Megan Tjandrasuwita, Zhengxuan Wu, Xuelin Yang, Kevin Liu, Rok Sosič, Jure Leskovec

In this work, we introduce Zero-shot Concept Recognition and Acquisition (ZeroC), a neuro-symbolic architecture that can recognize and acquire novel concepts in a zero-shot way.

Novel Concepts

Learning to Accelerate Partial Differential Equations via Latent Global Evolution

1 code implementation15 Jun 2022 Tailin Wu, Takashi Maruyama, Jure Leskovec

We test our method in a 1D benchmark of nonlinear PDEs, 2D Navier-Stokes flows into turbulent phase and an inverse optimization of boundary conditions in 2D Navier-Stokes flow.

Weather Forecasting

Learning Large-scale Subsurface Simulations with a Hybrid Graph Network Simulator

no code implementations15 Jun 2022 Tailin Wu, Qinchen Wang, Yinan Zhang, Rex Ying, Kaidi Cao, Rok Sosič, Ridwan Jalali, Hassan Hamam, Marko Maucec, Jure Leskovec

To model complex reservoir dynamics at both local and global scale, HGNS consists of a subsurface graph neural network (SGNN) to model the evolution of fluid flows, and a 3D-U-Net to model the evolution of pressure.

Decision Making

Learning Backward Compatible Embeddings

1 code implementation7 Jun 2022 Weihua Hu, Rajas Bansal, Kaidi Cao, Nikhil Rao, Karthik Subbian, Jure Leskovec

We formalize the problem where the goal is for the embedding team to keep updating the embedding version, while the consumer teams do not have to retrain their models.

Fraud Detection Product Recommendation +1

VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering

no code implementations ICCV 2023 Yanan Wang, Michihiro Yasunaga, Hongyu Ren, Shinya Wada, Jure Leskovec

Visual question answering (VQA) requires systems to perform concept-level reasoning by unifying unstructured (e. g., the context in question and answer; "QA context") and structured (e. g., knowledge graph for the QA context and scene; "concept graph") multimodal knowledge.

Knowledge Graphs Question Answering +1

PinnerFormer: Sequence Modeling for User Representation at Pinterest

no code implementations9 May 2022 Nikil Pancha, Andrew Zhai, Jure Leskovec, Charles Rosenberg

Sequential models have become increasingly popular in powering personalized recommendation systems over the past several years.

Recommendation Systems

AdaGrid: Adaptive Grid Search for Link Prediction Training Objective

1 code implementation30 Mar 2022 Tim Poštuvan, Jiaxuan You, Mohammadreza Banaei, Rémi Lebret, Jure Leskovec

To mitigate these limitations, we propose Adaptive Grid Search (AdaGrid), which dynamically adjusts the edge message ratio during training.

BIG-bench Machine Learning Link Prediction

GreaseLM: Graph REASoning Enhanced Language Models for Question Answering

1 code implementation21 Jan 2022 Xikun Zhang, Antoine Bosselut, Michihiro Yasunaga, Hongyu Ren, Percy Liang, Christopher D. Manning, Jure Leskovec

Answering complex questions about textual narratives requires reasoning over both stated context and the world knowledge that underlies it.

Knowledge Graphs Negation +2

Extending the WILDS Benchmark for Unsupervised Adaptation

1 code implementation ICLR 2022 Shiori Sagawa, Pang Wei Koh, Tony Lee, Irena Gao, Sang Michael Xie, Kendrick Shen, Ananya Kumar, Weihua Hu, Michihiro Yasunaga, Henrik Marklund, Sara Beery, Etienne David, Ian Stavness, Wei Guo, Jure Leskovec, Kate Saenko, Tatsunori Hashimoto, Sergey Levine, Chelsea Finn, Percy Liang

Unlabeled data can be a powerful point of leverage for mitigating these distribution shifts, as it is frequently much more available than labeled data and can often be obtained from distributions beyond the source distribution as well.

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

1 code implementation28 Oct 2021 Hongyu Ren, Hanjun Dai, Bo Dai, Xinyun Chen, Denny Zhou, Jure Leskovec, Dale Schuurmans

There are two important reasoning tasks on KGs: (1) single-hop knowledge graph completion, which involves predicting individual links in the KG; and (2), multi-hop reasoning, where the goal is to predict which KG entities satisfy a given logical query.

Scheduling

Modeling Heterogeneous Hierarchies with Relation-specific Hyperbolic Cones

1 code implementation NeurIPS 2021 Yushi Bai, Rex Ying, Hongyu Ren, Jure Leskovec

Here we present ConE (Cone Embedding), a KG embedding model that is able to simultaneously model multiple hierarchical as well as non-hierarchical relations in a knowledge graph.

Ancestor-descendant prediction Knowledge Graph Completion +2

GreaseLM: Graph REASoning Enhanced Language Models

no code implementations ICLR 2022 Xikun Zhang, Antoine Bosselut, Michihiro Yasunaga, Hongyu Ren, Percy Liang, Christopher D Manning, Jure Leskovec

Answering complex questions about textual narratives requires reasoning over both stated context and the world knowledge that underlies it.

Knowledge Graphs Negation +2

Leveraging the Cell Ontology to classify unseen cell types

1 code implementation Nature Communications 2021 Sheng Wang, Angela Oliveira Pisco, Aaron McGeever, Maria Brbic, Marinka Zitnik, Spyros Darmanis, Jure Leskovec, Jim Karkanias, Russ B. Altman

Single cell technologies are rapidly generating large amounts of data that enables us to understand biological systems at single-cell resolution.

Neural Distance Embeddings for Biological Sequences

1 code implementation NeurIPS 2021 Gabriele Corso, Rex Ying, Michal Pándy, Petar Veličković, Jure Leskovec, Pietro Liò

The development of data-dependent heuristics and representations for biological sequences that reflect their evolutionary distance is critical for large-scale biological research.

Multiple Sequence Alignment

LM-Critic: Language Models for Unsupervised Grammatical Error Correction

2 code implementations EMNLP 2021 Michihiro Yasunaga, Jure Leskovec, Percy Liang

Training a model for grammatical error correction (GEC) requires a set of labeled ungrammatical / grammatical sentence pairs, but manually annotating such pairs can be expensive.

Grammatical Error Correction Language Modelling +2

On the Opportunities and Risks of Foundation Models

2 code implementations16 Aug 2021 Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh, Li Fei-Fei, Chelsea Finn, Trevor Gale, Lauren Gillespie, Karan Goel, Noah Goodman, Shelby Grossman, Neel Guha, Tatsunori Hashimoto, Peter Henderson, John Hewitt, Daniel E. Ho, Jenny Hong, Kyle Hsu, Jing Huang, Thomas Icard, Saahil Jain, Dan Jurafsky, Pratyusha Kalluri, Siddharth Karamcheti, Geoff Keeling, Fereshte Khani, Omar Khattab, Pang Wei Koh, Mark Krass, Ranjay Krishna, Rohith Kuditipudi, Ananya Kumar, Faisal Ladhak, Mina Lee, Tony Lee, Jure Leskovec, Isabelle Levent, Xiang Lisa Li, Xuechen Li, Tengyu Ma, Ali Malik, Christopher D. Manning, Suvir Mirchandani, Eric Mitchell, Zanele Munyikwa, Suraj Nair, Avanika Narayan, Deepak Narayanan, Ben Newman, Allen Nie, Juan Carlos Niebles, Hamed Nilforoshan, Julian Nyarko, Giray Ogut, Laurel Orr, Isabel Papadimitriou, Joon Sung Park, Chris Piech, Eva Portelance, Christopher Potts, aditi raghunathan, Rob Reich, Hongyu Ren, Frieda Rong, Yusuf Roohani, Camilo Ruiz, Jack Ryan, Christopher Ré, Dorsa Sadigh, Shiori Sagawa, Keshav Santhanam, Andy Shih, Krishnan Srinivasan, Alex Tamkin, Rohan Taori, Armin W. Thomas, Florian Tramèr, Rose E. Wang, William Wang, Bohan Wu, Jiajun Wu, Yuhuai Wu, Sang Michael Xie, Michihiro Yasunaga, Jiaxuan You, Matei Zaharia, Michael Zhang, Tianyi Zhang, Xikun Zhang, Yuhui Zhang, Lucia Zheng, Kaitlyn Zhou, Percy Liang

AI is undergoing a paradigm shift with the rise of models (e. g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks.

Transfer Learning

Combiner: Full Attention Transformer with Sparse Computation Cost

2 code implementations NeurIPS 2021 Hongyu Ren, Hanjun Dai, Zihang Dai, Mengjiao Yang, Jure Leskovec, Dale Schuurmans, Bo Dai

However, the key limitation of transformers is their quadratic memory and time complexity $\mathcal{O}(L^2)$ with respect to the sequence length in attention layers, which restricts application in extremely long sequences.

Image Generation Language Modelling

GNNAutoScale: Scalable and Expressive Graph Neural Networks via Historical Embeddings

1 code implementation10 Jun 2021 Matthias Fey, Jan E. Lenssen, Frank Weichert, Jure Leskovec

We present GNNAutoScale (GAS), a framework for scaling arbitrary message-passing GNNs to large graphs.

QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering

4 code implementations NAACL 2021 Michihiro Yasunaga, Hongyu Ren, Antoine Bosselut, Percy Liang, Jure Leskovec

The problem of answering questions using knowledge from pre-trained language models (LMs) and knowledge graphs (KGs) presents two challenges: given a QA context (question and answer choice), methods need to (i) identify relevant knowledge from large KGs, and (ii) perform joint reasoning over the QA context and KG.

Graph Representation Learning Knowledge Graphs +5

OGB-LSC: A Large-Scale Challenge for Machine Learning on Graphs

6 code implementations17 Mar 2021 Weihua Hu, Matthias Fey, Hongyu Ren, Maho Nakata, Yuxiao Dong, Jure Leskovec

Enabling effective and efficient machine learning (ML) over large-scale graph data (e. g., graphs with billions of edges) can have a great impact on both industrial and scientific applications.

BIG-bench Machine Learning Graph Learning +4

Therapeutics Data Commons: Machine Learning Datasets and Tasks for Drug Discovery and Development

2 code implementations18 Feb 2021 Kexin Huang, Tianfan Fu, Wenhao Gao, Yue Zhao, Yusuf Roohani, Jure Leskovec, Connor W. Coley, Cao Xiao, Jimeng Sun, Marinka Zitnik

Here, we introduce Therapeutics Data Commons (TDC), the first unifying platform to systematically access and evaluate machine learning across the entire range of therapeutics.

BIG-bench Machine Learning Drug Discovery

Driver2vec: Driver Identification from Automotive Data

no code implementations10 Feb 2021 Jingbo Yang, Ruge Zhao, Meixian Zhu, David Hallac, Jaka Sodnik, Jure Leskovec

In this paper, we develop a deep learning architecture (Driver2vec) to map a short interval of driving data into an embedding space that represents the driver's behavior to assist in driver identification.

Autonomous Driving Driver Identification

Open-World Semi-Supervised Learning

1 code implementation ICLR 2022 Kaidi Cao, Maria Brbic, Jure Leskovec

Here, we introduce a novel open-world semi-supervised learning setting that formalizes the notion that novel classes may appear in the unlabeled test data.

Image Classification Novel Object Detection +1

Identity-aware Graph Neural Networks

1 code implementation25 Jan 2021 Jiaxuan You, Jonathan Gomes-Selman, Rex Ying, Jure Leskovec

However, the expressive power of existing GNNs is upper-bounded by the 1-Weisfeiler-Lehman (1-WL) graph isomorphism test, which means GNNs that are not able to predict node clustering coefficients and shortest path distances, and cannot differentiate between different d-regular graphs.

Graph Classification Graph Property Prediction +3

ForceNet: A Graph Neural Network for Large-Scale Quantum Chemistry Simulation

no code implementations1 Jan 2021 Weihua Hu, Muhammed Shuaibi, Abhishek Das, Siddharth Goyal, Anuroop Sriram, Jure Leskovec, Devi Parikh, Larry Zitnick

We use ForceNet to perform quantum chemistry simulations, where ForceNet is able to achieve 4x higher success rate than existing ML models.

Atomic Forces

Coresets for Robust Training of Deep Neural Networks against Noisy Labels

no code implementations NeurIPS 2020 Baharan Mirzasoleiman, Kaidi Cao, Jure Leskovec

Modern neural networks have the capacity to overfit noisy labels frequently found in real-world datasets.

Design Space for Graph Neural Networks

2 code implementations NeurIPS 2020 Jiaxuan You, Rex Ying, Jure Leskovec

However, current research focuses on proposing and evaluating specific architectural designs of GNNs, as opposed to studying the more general design space of GNNs that consists of a Cartesian product of different design dimensions, such as the number of layers or the type of the aggregation function.

Management

F-FADE: Frequency Factorization for Anomaly Detection in Edge Streams

1 code implementation9 Nov 2020 Yen-Yu Chang, Pan Li, Rok Sosic, M. H. Afifi, Marco Schweighauser, Jure Leskovec

Edge streams are commonly used to capture interactions in dynamic networks, such as email, social, or computer networks.

Anomaly Detection Anomaly Detection in Edge Streams

Handling Missing Data with Graph Representation Learning

1 code implementation NeurIPS 2020 Jiaxuan You, Xiaobai Ma, Daisy Yi Ding, Mykel Kochenderfer, Jure Leskovec

GRAPE tackles the missing data problem using a graph representation, where the observations and features are viewed as two types of nodes in a bipartite graph, and the observed feature values as edges.

Graph Representation Learning Imputation

Graph Information Bottleneck

1 code implementation NeurIPS 2020 Tailin Wu, Hongyu Ren, Pan Li, Jure Leskovec

We design two sampling algorithms for structural regularization and instantiate the GIB principle with two new models: GIB-Cat and GIB-Bern, and demonstrate the benefits by evaluating the resilience to adversarial attacks.

Representation Learning

Multi-hop Attention Graph Neural Network

1 code implementation29 Sep 2020 Guangtao Wang, Rex Ying, Jing Huang, Jure Leskovec

Currently, at every layer, attention is computed between connected pairs of nodes and depends solely on the representation of the two nodes.

Graph Representation Learning Knowledge Graph Completion +1

Inductive Learning on Commonsense Knowledge Graph Completion

1 code implementation19 Sep 2020 Bin Wang, Guangtao Wang, Jing Huang, Jiaxuan You, Jure Leskovec, C. -C. Jay Kuo

Here, we propose to study the inductive learning setting for CKG completion where unseen entities may present at test time.

Entity Embeddings Knowledge Graph Completion +2

OCEAN: Online Task Inference for Compositional Tasks with Context Adaptation

1 code implementation17 Aug 2020 Hongyu Ren, Yuke Zhu, Jure Leskovec, Anima Anandkumar, Animesh Garg

We propose a variational inference framework OCEAN to perform online task inference for compositional tasks.

Variational Inference

Concept Learners for Few-Shot Learning

2 code implementations ICLR 2021 Kaidi Cao, Maria Brbic, Jure Leskovec

Developing algorithms that are able to generalize to a novel task given only a few labeled examples represents a fundamental challenge in closing the gap between machine- and human-level performance.

Few-Shot Learning Fine-Grained Image Classification

Graph Structure of Neural Networks

3 code implementations ICML 2020 Jiaxuan You, Jure Leskovec, Kaiming He, Saining Xie

Neural networks are often represented as graphs of connections between neurons.

Clustering

Neural Subgraph Matching

no code implementations6 Jul 2020 Rex, Ying, Zhaoyu Lou, Jiaxuan You, Chengtao Wen, Arquimedes Canedo, Jure Leskovec

Subgraph matching is the problem of determining the presence and location(s) of a given query graph in a large target graph.

Improving Query Safety at Pinterest

no code implementations20 Jun 2020 Abhijit Mahabal, Yinrui Li, Rajat Raina, Daniel Sun, Revati Mahajan, Jure Leskovec

Identifying unsafe queries is necessary to protect users from inappropriate query suggestions.

M2P2: Multimodal Persuasion Prediction using Adaptive Fusion

no code implementations3 Jun 2020 Chongyang Bai, Haipeng Chen, Srijan Kumar, Jure Leskovec, V. S. Subrahmanian

Our M2P2 (Multimodal Persuasion Prediction) framework is the first to use multimodal (acoustic, visual, language) data to solve the IPP problem.

Graphs over Time: Densification Laws, Shrinking Diameters and Possible Explanations

1 code implementation ‎‎‏‏‎ ‎ 2020 Jure Leskovec, Jon Kleinberg, Christos Faloutsos

We provide a new graph generator, based on a "forest fire" spreading process, that has a simple, intuitive justification, requires very few parameters (like the "flammability" of nodes), and produces graphs exhibiting the full range of properties observed both in prior work and in the present study.

Graph Generation

Open Graph Benchmark: Datasets for Machine Learning on Graphs

20 code implementations NeurIPS 2020 Weihua Hu, Matthias Fey, Marinka Zitnik, Yuxiao Dong, Hongyu Ren, Bowen Liu, Michele Catasta, Jure Leskovec

We present the Open Graph Benchmark (OGB), a diverse set of challenging and realistic benchmark datasets to facilitate scalable, robust, and reproducible graph machine learning (ML) research.

Knowledge Graphs Node Property Prediction

Learning to Simulate Complex Physics with Graph Networks

12 code implementations ICML 2020 Alvaro Sanchez-Gonzalez, Jonathan Godwin, Tobias Pfaff, Rex Ying, Jure Leskovec, Peter W. Battaglia

Here we present a machine learning framework and model implementation that can learn to simulate a wide variety of challenging physical domains, involving fluids, rigid solids, and deformable materials interacting with one another.

Unifying Graph Convolutional Neural Networks and Label Propagation

2 code implementations17 Feb 2020 Hongwei Wang, Jure Leskovec

Both solve the task of node classification but LPA propagates node label information across the edges of the graph, while GCN propagates and transforms node feature information.

Classification General Classification +1

Relational Message Passing for Knowledge Graph Completion

4 code implementations17 Feb 2020 Hongwei Wang, Hongyu Ren, Jure Leskovec

Specifically, two kinds of neighborhood topology are modeled for a given entity pair under the relational message passing framework: (1) Relational context, which captures the relation types of edges adjacent to the given entity pair; (2) Relational paths, which characterize the relative position between the given two entities in the knowledge graph.

Knowledge Graph Completion Relation

Query2box: Reasoning over Knowledge Graphs in Vector Space using Box Embeddings

6 code implementations ICLR 2020 Hongyu Ren, Weihua Hu, Jure Leskovec

Our main insight is that queries can be embedded as boxes (i. e., hyper-rectangles), where a set of points inside the box corresponds to a set of answer entities of the query.

Complex Query Answering

G2SAT: Learning to Generate SAT Formulas

1 code implementation NeurIPS 2019 Jiaxuan You, Haoze Wu, Clark Barrett, Raghuram Ramanujan, Jure Leskovec

The Boolean Satisfiability (SAT) problem is the canonical NP-complete problem and is fundamental to computer science, with a wide array of applications in planning, verification, and theorem proving.

Automated Theorem Proving

Hyperbolic Graph Convolutional Neural Networks

3 code implementations NeurIPS 2019 Ines Chami, Rex Ying, Christopher Ré, Jure Leskovec

Here we propose Hyperbolic Graph Convolutional Neural Network (HGCN), the first inductive hyperbolic GCN that leverages both the expressiveness of GCNs and hyperbolic geometry to learn inductive node representations for hierarchical and scale-free graphs.

 Ranked #1 on Link Prediction on PPI (Accuracy metric)

Link Prediction Node Classification

Improving Graph Attention Networks with Large Margin-based Constraints

no code implementations25 Oct 2019 Guangtao Wang, Rex Ying, Jing Huang, Jure Leskovec

Graph Attention Networks (GATs) are the state-of-the-art neural architecture for representation learning with graphs.

Graph Attention Representation Learning

Coresets for Accelerating Incremental Gradient Methods

no code implementations25 Sep 2019 Baharan Mirzasoleiman, Jeff Bilmes, Jure Leskovec

But because at each epoch the gradients are computed only on the subset S, we obtain a speedup that is inversely proportional to the size of S. Our subset selection algorithm is fully general and can be applied to most IG methods.

Open-Ended Question Answering

Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks

1 code implementation3 Aug 2019 Srijan Kumar, Xikun Zhang, Jure Leskovec

However, existing dynamic embedding methods generate embeddings only when users take actions and do not explicitly model the future trajectory of the user/item in the embedding space.

Representation Learning

Selection via Proxy: Efficient Data Selection for Deep Learning

1 code implementation ICLR 2020 Cody Coleman, Christopher Yeh, Stephen Mussmann, Baharan Mirzasoleiman, Peter Bailis, Percy Liang, Jure Leskovec, Matei Zaharia

By removing hidden layers from the target model, using smaller architectures, and training for fewer epochs, we create proxies that are an order of magnitude faster to train.

Active Learning Computational Efficiency

Position-aware Graph Neural Networks

2 code implementations11 Jun 2019 Jiaxuan You, Rex Ying, Jure Leskovec

Learning node embeddings that capture a node's position within the broader graph structure is crucial for many prediction tasks on graphs.

Community Detection Link Prediction +1

Redundancy-Free Computation Graphs for Graph Neural Networks

no code implementations9 Jun 2019 Zhihao Jia, Sina Lin, Rex Ying, Jiaxuan You, Jure Leskovec, Alex Aiken

Graph Neural Networks (GNNs) are based on repeated aggregations of information across nodes' neighbors in a graph.

Coresets for Data-efficient Training of Machine Learning Models

3 code implementations ICML 2020 Baharan Mirzasoleiman, Jeff Bilmes, Jure Leskovec

Here we develop CRAIG, a method to select a weighted subset (or coreset) of training data that closely estimates the full gradient by maximizing a submodular function.

BIG-bench Machine Learning Open-Ended Question Answering

Strategies for Pre-training Graph Neural Networks

10 code implementations ICLR 2020 Weihua Hu, Bowen Liu, Joseph Gomes, Marinka Zitnik, Percy Liang, Vijay Pande, Jure Leskovec

Many applications of machine learning require a model to make accurate pre-dictions on test examples that are distributionally different from training ones, while task-specific labels are scarce during training.

Graph Classification Molecular Property Prediction +4

Select Via Proxy: Efficient Data Selection For Training Deep Networks

no code implementations ICLR 2019 Cody Coleman, Stephen Mussmann, Baharan Mirzasoleiman, Peter Bailis, Percy Liang, Jure Leskovec, Matei Zaharia

In our approach, we first train a small proxy model quickly, which we then use to estimate the utility of individual training data points, and then select the most informative ones for training the large target model.

BIG-bench Machine Learning Image Classification +1

Hierarchical Temporal Convolutional Networks for Dynamic Recommender Systems

no code implementations8 Apr 2019 Jiaxuan You, Yichen Wang, Aditya Pal, Pong Eksombatchai, Chuck Rosenberg, Jure Leskovec

Recommender systems that can learn from cross-session data to dynamically predict the next item a user will choose are crucial for online platforms.

Session-Based Recommendations

GNNExplainer: Generating Explanations for Graph Neural Networks

10 code implementations NeurIPS 2019 Rex Ying, Dylan Bourgeois, Jiaxuan You, Marinka Zitnik, Jure Leskovec

We formulate GNNExplainer as an optimization task that maximizes the mutual information between a GNN's prediction and distribution of possible subgraph structures.

BIG-bench Machine Learning Explainable artificial intelligence +2

Learning Dynamic Embeddings from Temporal Interactions

no code implementations6 Dec 2018 Srijan Kumar, Xikun Zhang, Jure Leskovec

Here we present JODIE, a coupled recurrent model to jointly learn the dynamic embeddings of users and items from a sequence of user-item interactions.

Representation Learning

Complete the Look: Scene-based Complementary Product Recommendation

1 code implementation CVPR 2019 Wang-Cheng Kang, Eric Kim, Jure Leskovec, Charles Rosenberg, Julian McAuley

We design an approach to extract training data for this task, and propose a novel way to learn the scene-product compatibility from fashion or interior design images.

Product Recommendation

How Powerful are Graph Neural Networks?

18 code implementations ICLR 2019 Keyulu Xu, Weihua Hu, Jure Leskovec, Stefanie Jegelka

Here, we present a theoretical framework for analyzing the expressive power of GNNs to capture different graph structures.

General Classification Graph Classification +3

MASA: Motif-Aware State Assignment in Noisy Time Series Data

1 code implementation6 Sep 2018 Saachi Jain, David Hallac, Rok Sosic, Jure Leskovec

Such data can be interpreted as a sequence of states, where each state represents a prototype of system behavior.

Clustering Time Series +1

Inferring Multidimensional Rates of Aging from Cross-Sectional Data

1 code implementation12 Jul 2018 Emma Pierson, Pang Wei Koh, Tatsunori Hashimoto, Daphne Koller, Jure Leskovec, Nicholas Eriksson, Percy Liang

Motivated by the study of human aging, we present an interpretable latent-variable model that learns temporal dynamics from cross-sectional data.

Human Aging Time Series +1

Hierarchical Graph Representation Learning with Differentiable Pooling

14 code implementations NeurIPS 2018 Rex Ying, Jiaxuan You, Christopher Morris, Xiang Ren, William L. Hamilton, Jure Leskovec

Recently, graph neural networks (GNNs) have revolutionized the field of graph representation learning through effectively learned node embeddings, and achieved state-of-the-art results in tasks such as node classification and link prediction.

General Classification Graph Classification +3

Drive2Vec: Multiscale State-Space Embedding of Vehicular Sensor Data

no code implementations12 Jun 2018 David Hallac, Suvrat Bhooshan, Michael Chen, Kacem Abida, Rok Sosic, Jure Leskovec

With automobiles becoming increasingly reliant on sensors to perform various driving tasks, it is important to encode the relevant CAN bus sensor data in a way that captures the general state of the vehicle in a compact form.

Graph Convolutional Policy Network for Goal-Directed Molecular Graph Generation

2 code implementations NeurIPS 2018 Jiaxuan You, Bowen Liu, Rex Ying, Vijay Pande, Jure Leskovec

Generating novel graph structures that optimize given objectives while obeying some given underlying rules is fundamental for chemistry, biology and social science research.

Graph Generation Molecular Graph Generation

Graph Convolutional Neural Networks for Web-Scale Recommender Systems

5 code implementations6 Jun 2018 Rex Ying, Ruining He, Kai-Feng Chen, Pong Eksombatchai, William L. Hamilton, Jure Leskovec

We develop a data-efficient Graph Convolutional Network (GCN) algorithm PinSage, which combines efficient random walks and graph convolutions to generate embeddings of nodes (i. e., items) that incorporate both graph structure as well as node feature information.

Recommendation Systems

Embedding Logical Queries on Knowledge Graphs

5 code implementations NeurIPS 2018 William L. Hamilton, Payal Bajaj, Marinka Zitnik, Dan Jurafsky, Jure Leskovec

Learning low-dimensional embeddings of knowledge graphs is a powerful approach used to predict unobserved or missing edges between entities.

Complex Query Answering

Dynamic Network Model from Partial Observations

no code implementations NeurIPS 2018 Elahe Ghalebi, Baharan Mirzasoleiman, Radu Grosu, Jure Leskovec

We propose a novel framework for providing a non-parametric dynamic network model--based on a mixture of coupled hierarchical Dirichlet processes-- based on data capturing cascade node infection times.

Open-Ended Question Answering

Network Enhancement: a general method to denoise weighted biological networks

no code implementations9 May 2018 Bo Wang, Armin Pourshafeie, Marinka Zitnik, Junjie Zhu, Carlos D. Bustamante, Serafim Batzoglou, Jure Leskovec

Networks are ubiquitous in biology where they encode connectivity patterns at all scales of organization, from molecular to the biome.

Denoising

Prioritizing network communities

no code implementations7 May 2018 Marinka Zitnik, Rok Sosic, Jure Leskovec

Community detection identifies candidate modules as hypotheses, which then need to be validated through experiments, such as mutagenesis in a biological laboratory.

Community Detection

Community Interaction and Conflict on the Web

no code implementations9 Mar 2018 Srijan Kumar, William L. Hamilton, Jure Leskovec, Dan Jurafsky

Here we study intercommunity interactions across 36, 000 communities on Reddit, examining cases where users of one community are mobilized by negative sentiment to comment in another community.

GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models

3 code implementations ICML 2018 Jiaxuan You, Rex Ying, Xiang Ren, William L. Hamilton, Jure Leskovec

Modeling and generating graphs is fundamental for studying networks in biology, engineering, and social sciences.

Graph Generation

Modeling polypharmacy side effects with graph convolutional networks

1 code implementation2 Feb 2018 Marinka Zitnik, Monica Agrawal, Jure Leskovec

The approach constructs a multimodal graph of protein-protein interactions, drug-protein target interactions, and the polypharmacy side effects, which are represented as drug-drug interactions, where each side effect is an edge of a different type.

Link Prediction

Spectral Graph Wavelets for Structural Role Similarity in Networks

no code implementations ICLR 2018 Claire Donnat, Marinka Zitnik, David Hallac, Jure Leskovec

Nodes residing in different parts of a graph can have similar structural roles within their local network topology.

Large-scale analysis of disease pathways in the human interactome

no code implementations3 Dec 2017 Monica Agrawal, Marinka Zitnik, Jure Leskovec

However, we show that higher-order network structures, such as small subgraphs of the pathway, provide a promising direction for the development of new methods.

Learning Structural Node Embeddings Via Diffusion Wavelets

1 code implementation KDD 2018 Claire Donnat, Marinka Zitnik, David Hallac, Jure Leskovec

Nodes residing in different parts of a graph can have similar structural roles within their local network topology.

Data-Driven Model Predictive Control of Autonomous Mobility-on-Demand Systems

no code implementations20 Sep 2017 Ramon Iglesias, Federico Rossi, Kevin Wang, David Hallac, Jure Leskovec, Marco Pavone

The goal of this paper is to present an end-to-end, data-driven framework to control Autonomous Mobility-on-Demand systems (AMoD, i. e. fleets of self-driving vehicles).

Robotics Multiagent Systems Systems and Control Applications

Representation Learning on Graphs: Methods and Applications

no code implementations17 Sep 2017 William L. Hamilton, Rex Ying, Jure Leskovec

Machine learning on graphs is an important and ubiquitous task with applications ranging from drug design to friendship recommendation in social networks.

BIG-bench Machine Learning Dimensionality Reduction +1

Predicting multicellular function through multi-layer tissue networks

1 code implementation14 Jul 2017 Marinka Zitnik, Jure Leskovec

We use OhmNet to study multicellular function in a multi-layer protein interaction network of 107 human tissues.

Interpretable & Explorable Approximations of Black Box Models

no code implementations4 Jul 2017 Himabindu Lakkaraju, Ece Kamar, Rich Caruana, Jure Leskovec

To the best of our knowledge, this is the first approach which can produce global explanations of the behavior of any given black box model through joint optimization of unambiguity, fidelity, and interpretability, while also allowing users to explore model behavior based on their preferences.

Toeplitz Inverse Covariance-Based Clustering of Multivariate Time Series Data

no code implementations10 Jun 2017 David Hallac, Sagar Vare, Stephen Boyd, Jure Leskovec

We derive closed-form solutions to efficiently solve the two resulting subproblems in a scalable way, through dynamic programming and the alternating direction method of multipliers (ADMM), respectively.

Clustering Time Series +1

Driver Identification Using Automobile Sensor Data from a Single Turn

no code implementations9 Jun 2017 David Hallac, Abhijit Sharang, Rainer Stahlmann, Andreas Lamprecht, Markus Huber, Martin Roehder, Rok Sosic, Jure Leskovec

In this paper, we propose a method to predict, from sensor data collected at a single turn, the identity of a driver out of a given set of individuals.

Driver Identification Navigate +2

Inductive Representation Learning on Large Graphs

18 code implementations NeurIPS 2017 William L. Hamilton, Rex Ying, Jure Leskovec

Low-dimensional embeddings of nodes in large graphs have proved extremely useful in a variety of prediction tasks, from content recommendation to identifying protein functions.

Graph Classification Graph Regression +5

Community Identity and User Engagement in a Multi-Community Landscape

no code implementations26 May 2017 Justine Zhang, William L. Hamilton, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky, Jure Leskovec

To this end we introduce a quantitative, language-based typology reflecting two key aspects of a community's identity: how distinctive, and how temporally dynamic it is.

Higher-order clustering in networks

no code implementations12 Apr 2017 Hao Yin, Austin R. Benson, Jure Leskovec

Here we introduce higher-order clustering coefficients that measure the closure probability of higher-order network cliques and provide a more comprehensive view of how the edges of complex networks cluster.

Clustering

An Army of Me: Sockpuppets in Online Discussion Communities

no code implementations21 Mar 2017 Srijan Kumar, Justin Cheng, Jure Leskovec, V. S. Subrahmanian

Further, pairs of sockpuppets controlled by the same individual are more likely to interact on the same discussion at the same time than pairs of ordinary users.

Loyalty in Online Communities

1 code implementation9 Mar 2017 William L. Hamilton, Justine Zhang, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky, Jure Leskovec

In this paper we operationalize loyalty as a user-community relation: users loyal to a community consistently prefer it over all others; loyal communities retain their loyal users over time.

Network Inference via the Time-Varying Graphical Lasso

1 code implementation6 Mar 2017 David Hallac, Youngsuk Park, Stephen Boyd, Jure Leskovec

Many important problems can be modeled as a system of interconnected entities, where each entity is recording time-dependent observations or measurements.

Time Series Time Series Analysis

Motifs in Temporal Networks

no code implementations29 Dec 2016 Ashwin Paranjape, Austin R. Benson, Jure Leskovec

Networks are a fundamental tool for modeling complex systems in a variety of domains including social and communication networks as well as biology and neuroscience.

Higher-order organization of complex networks

no code implementations26 Dec 2016 Austin R. Benson, David F. Gleich, Jure Leskovec

Many networks are known to exhibit rich, lower-order connectivity patterns that can be captured at the level of individual nodes and edges.

Social and Information Networks Discrete Mathematics Physics and Society

Confusions over Time: An Interpretable Bayesian Model to Characterize Trends in Decision Making

no code implementations NeurIPS 2016 Himabindu Lakkaraju, Jure Leskovec

We propose Confusions over Time (CoT), a novel generative framework which facilitates a multi-granular analysis of the decision making process.

Decision Making

node2vec: Scalable Feature Learning for Networks

18 code implementations3 Jul 2016 Aditya Grover, Jure Leskovec

Taken together, our work represents a new way for efficiently learning state-of-the-art task-independent representations in complex networks.

Link Prediction Multi-Label Classification +2

Cultural Shift or Linguistic Drift? Comparing Two Computational Measures of Semantic Change

no code implementations EMNLP 2016 William L. Hamilton, Jure Leskovec, Dan Jurafsky

Words shift in meaning for many reasons, including cultural factors like new technologies and regular linguistic processes like subjectification.

Cultural Vocal Bursts Intensity Prediction

Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change

4 code implementations ACL 2016 William L. Hamilton, Jure Leskovec, Dan Jurafsky

Understanding how words change their meanings over time is key to models of language and cultural evolution, but historical data on meaning is scarce, making theories hard to develop and test.

Diachronic Word Embeddings Word Embeddings

Growing Wikipedia Across Languages via Recommendation

2 code implementations12 Apr 2016 Ellery Wulczyn, Robert West, Leila Zia, Jure Leskovec

The system involves identifying missing articles, ranking the missing articles according to their importance, and recommending important missing articles to editors based on their interests.

Social and Information Networks Digital Libraries

Do Cascades Recur?

no code implementations2 Feb 2016 Justin Cheng, Lada A. Adamic, Jon Kleinberg, Jure Leskovec

In this paper, we perform a large-scale analysis of cascades on Facebook over significantly longer time scales, and find that a more complex picture emerges, in which many large cascades recur, exhibiting multiple bursts of popularity with periods of quiescence in between.

SEISMIC: A Self-Exciting Point Process Model for Predicting Tweet Popularity

1 code implementation8 Jun 2015 Qingyuan Zhao, Murat A. Erdogdu, Hera Y. He, Anand Rajaraman, Jure Leskovec

Social networking websites allow users to create and share content.

Social and Information Networks Physics and Society Applications 60G55, 62P25 H.2.8

QUOTUS: The Structure of Political Media Coverage as Revealed by Quoting Patterns

no code implementations6 Apr 2015 Vlad Niculae, Caroline Suen, Justine Zhang, Cristian Danescu-Niculescu-Mizil, Jure Leskovec

By encoding bias patterns in a low-rank space we provide an analysis of the structure of political media coverage.

Antisocial Behavior in Online Discussion Communities

no code implementations2 Apr 2015 Justin Cheng, Cristian Danescu-Niculescu-Mizil, Jure Leskovec

User contributions in the form of posts, comments, and votes are essential to the success of online communities.

Exploiting Social Network Structure for Person-to-Person Sentiment Analysis

no code implementations TACL 2014 Robert West, Hristo S. Paskov, Jure Leskovec, Christopher Potts

Person-to-person evaluations are prevalent in all kinds of discourse and important for establishing reputations, building social bonds, and shaping public opinion.

Decision Making Sentiment Analysis

How Community Feedback Shapes User Behavior

no code implementations6 May 2014 Justin Cheng, Cristian Danescu-Niculescu-Mizil, Jure Leskovec

Interestingly, the authors that receive no feedback are most likely to leave a community.

Can Cascades be Predicted?

no code implementations18 Mar 2014 Justin Cheng, Lada A. Adamic, P. Alex Dow, Jon Kleinberg, Jure Leskovec

On a large sample of photo reshare cascades on Facebook, we find strong performance in predicting whether a cascade will continue to grow in the future.

Engaging with Massive Online Courses

no code implementations12 Mar 2014 Ashton Anderson, Daniel Huttenlocher, Jon Kleinberg, Jure Leskovec

We also report on a large-scale deployment of badges as incentives for engagement in a MOOC, including randomized experiments in which the presentation of badges was varied across sub-populations.

The Bursty Dynamics of the Twitter Information Network

no code implementations11 Mar 2014 Seth A. Myers, Jure Leskovec

Information diffusion in the form of cascades of post re-sharing often creates such sudden bursts of new connections, which significantly change users' local network structure.

Nonparametric Multi-group Membership Model for Dynamic Networks

no code implementations NeurIPS 2013 Myunghwan Kim, Jure Leskovec

Relational data-like graphs, networks, and matrices-is often dynamic, where the relational structure evolves over time.

Link Prediction

Modeling Information Propagation with Survival Theory

no code implementations15 May 2013 Manuel Gomez Rodriguez, Jure Leskovec, Bernhard Schoelkopf

Networks provide a skeleton for the spread of contagions, like, information, ideas, behaviors and diseases.

From Amateurs to Connoisseurs: Modeling the Evolution of User Expertise through Online Reviews

no code implementations18 Mar 2013 Julian McAuley, Jure Leskovec

Recommending products to consumers means not only understanding their tastes, but also understanding their level of experience.

Structure and Dynamics of Information Pathways in Online Media

no code implementations6 Dec 2012 Manuel Gomez Rodriguez, Jure Leskovec, Bernhard Schölkopf

We assume there is an unobserved dynamic network that changes over time, while we observe the results of a dynamic process spreading over the edges of the network.

Learning to Discover Social Circles in Ego Networks

no code implementations NeurIPS 2012 Jure Leskovec, Julian J. McAuley

We develop a model for detecting circles that combines network structure as well as user profile information.

Clustering Node Clustering

Image Labeling on a Network: Using Social-Network Metadata for Image Classification

no code implementations16 Jul 2012 Julian McAuley, Jure Leskovec

Many of these benchmarks are derived from online photo sharing networks, like Flickr, which in addition to hosting images also provide a highly interactive social community.

General Classification Image Classification +2

Defining and Evaluating Network Communities based on Ground-truth

no code implementations28 May 2012 Jaewon Yang, Jure Leskovec

Identifying such communities of nodes has proven to be a challenging task mainly due to a plethora of definitions of a community, intractability of algorithms, issues with evaluation and the lack of a reliable gold-standard ground-truth.

Social and Information Networks Physics and Society

On the Convexity of Latent Social Network Inference

no code implementations NeurIPS 2010 Seth Myers, Jure Leskovec

Moreover, our approach scales well as it can infer optimal networks on thousands of nodes in a matter of minutes.

Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters

no code implementations8 Oct 2008 Jure Leskovec, Kevin J. Lang, Anirban Dasgupta, Michael W. Mahoney

A large body of work has been devoted to defining and identifying clusters or communities in social and information networks.

Data Structures and Algorithms Data Analysis, Statistics and Probability Physics and Society

Cost-effective Outbreak Detection in Networks

1 code implementation SIGKDD 2007 Jure Leskovec, Andreas Krause, Carlos Guestrin, Christos Faloutsos, Jeanne VanBriesen, Natalie Glance

We show that the approach scales, achieving speedups and savings in storage of several orders of magnitude.

Sampling From Large Graphs

1 code implementation KDD 2006 Jure Leskovec, Christos Faloutsos

Thus graph sampling is essential. The natural questions to ask are (a) which sampling method to use, (b) how small can the sample size be, and (c) how to scale up the measurements of the sample (e. g., the diameter), to get estimates for the large graph.

Graph Sampling Natural Questions

Cannot find the paper you are looking for? You can Submit a new open access paper.