1 code implementation • 17 Jun 2024 • Shirley Wu, Shiyu Zhao, Qian Huang, Kexin Huang, Michihiro Yasunaga, Kaidi Cao, Vassilis N. Ioannidis, Karthik Subbian, Jure Leskovec, James Zou
Large language model (LLM) agents have demonstrated impressive capabilities in utilizing external tools and knowledge to boost accuracy and reduce hallucinations.
1 code implementation • 19 Apr 2024 • Shirley Wu, Shiyu Zhao, Michihiro Yasunaga, Kexin Huang, Kaidi Cao, Qian Huang, Vassilis N. Ioannidis, Karthik Subbian, James Zou, Jure Leskovec
To address the gap, we develop STARK, a large-scale Semi-structure retrieval benchmark on Textual and Relational Knowledge Bases.
1 code implementation • 31 Mar 2024 • Weihua Hu, Yiwen Yuan, Zecheng Zhang, Akihiro Nitta, Kaidi Cao, Vid Kocijan, Jinu Sunil, Jure Leskovec, Matthias Fey
We present PyTorch Frame, a PyTorch-based framework for deep learning over multi-modal tabular data.
Ranked #1 on Binary Classification on kickstarter
1 code implementation • 7 Dec 2023 • Shirley Wu, Kaidi Cao, Bruno Ribeiro, James Zou, Jure Leskovec
GraphMETRO employs a Mixture-of-Experts (MoE) architecture with a gating model and multiple expert models, where each expert model targets a specific distributional shift to produce a referential representation w. r. t.
1 code implementation • NeurIPS 2023 • Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Kaidi Cao, Bahare Fatemi, Mike Burrows, Charith Mendis, Bryan Perozzi
TpuGraphs provides 25x more graphs than the largest graph property prediction dataset (with comparable graph sizes), and 770x larger graphs on average compared to existing performance prediction datasets on machine learning programs.
Ranked #2 on Runtime ranking on TpuGraphs Layout mean
no code implementations • 6 Aug 2023 • Kaidi Cao, Rui Deng, Shirley Wu, Edward W Huang, Karthik Subbian, Jure Leskovec
Here, we introduce CoFree-GNN, a novel distributed GNN training framework that significantly speeds up the training process by implementing communication-free training.
Ranked #2 on Node Classification on Reddit
1 code implementation • NeurIPS 2023 • Kaidi Cao, Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Dustin Zelle, Yanqi Zhou, Charith Mendis, Jure Leskovec, Bryan Perozzi
Here we propose Graph Segment Training (GST), a general framework that utilizes a divide-and-conquer approach to allow learning large graph property prediction with a constant memory footprint.
1 code implementation • 14 Mar 2023 • Kaidi Cao, Jiaxuan You, Jiaju Liu, Jure Leskovec
Experiments demonstrate that (i) our proposed task embedding can be computed efficiently, and that tasks with similar embeddings have similar best-performing architectures; (ii) AutoTransfer significantly improves search efficiency with the transferred design priors, reducing the number of explored architectures by an order of magnitude.
1 code implementation • ICLR 2022 • Kaidi Cao, Jiaxuan You, Jure Leskovec
Here we introduce a novel relational multi-task learning setting where we leverage data point labels from auxiliary tasks to make more accurate predictions on the new task.
no code implementations • 26 Oct 2022 • Weihua Hu, Kaidi Cao, Kexin Huang, Edward W Huang, Karthik Subbian, Kenji Kawaguchi, Jure Leskovec
Extensive evaluation of TuneUp on five diverse GNN architectures, three types of prediction tasks, and both transductive and inductive settings shows that TuneUp significantly improves the performance of the base GNN on tail nodes, while often even improving the performance on head nodes.
no code implementations • 15 Jun 2022 • Tailin Wu, Qinchen Wang, Yinan Zhang, Rex Ying, Kaidi Cao, Rok Sosič, Ridwan Jalali, Hassan Hamam, Marko Maucec, Jure Leskovec
To model complex reservoir dynamics at both local and global scale, HGNS consists of a subsurface graph neural network (SGNN) to model the evolution of fluid flows, and a 3D-U-Net to model the evolution of pressure.
1 code implementation • 7 Jun 2022 • Weihua Hu, Rajas Bansal, Kaidi Cao, Nikhil Rao, Karthik Subbian, Jure Leskovec
We formalize the problem where the goal is for the embedding team to keep updating the embedding version, while the consumer teams do not have to retrain their models.
1 code implementation • ICLR 2022 • Kaidi Cao, Maria Brbic, Jure Leskovec
Here, we introduce a novel open-world semi-supervised learning setting that formalizes the notion that novel classes may appear in the unlabeled test data.
Ranked #2 on Open-World Semi-Supervised Learning on ImageNet-100
no code implementations • NeurIPS 2020 • Baharan Mirzasoleiman, Kaidi Cao, Jure Leskovec
Modern neural networks have the capacity to overfit noisy labels frequently found in real-world datasets.
no code implementations • 15 Nov 2020 • Baharan Mirzasoleiman, Kaidi Cao, Jure Leskovec
Modern neural networks have the capacity to overfit noisy labels frequently found in real-world datasets.
Ranked #38 on Image Classification on mini WebVision 1.0
2 code implementations • ICLR 2021 • Kaidi Cao, Maria Brbic, Jure Leskovec
Developing algorithms that are able to generalize to a novel task given only a few labeled examples represents a fundamental challenge in closing the gap between machine- and human-level performance.
1 code implementation • ICLR 2021 • Kaidi Cao, Yining Chen, Junwei Lu, Nikos Arechiga, Adrien Gaidon, Tengyu Ma
Real-world large-scale datasets are heteroskedastic and imbalanced -- labels have varying levels of uncertainty and label distributions are long-tailed.
Ranked #11 on Image Classification on WebVision-1000
no code implementations • ICCV 2019 • Jingwei Ji, Kaidi Cao, Juan Carlos Niebles
Most current methods for training action proposal modules rely on fully supervised approaches that require large amounts of annotated temporal action intervals in long video sequences.
Ranked #3 on Semi-Supervised Action Detection on ActivityNet-1.3
1 code implementation • ICCV 2019 • Yu Rong, Ziwei Liu, Cheng Li, Kaidi Cao, Chen Change Loy
Specifically, we focus on the challenging task of in-the-wild 3D human recovery from single images when paired 3D annotations are not fully available.
no code implementations • CVPR 2020 • Kaidi Cao, Jingwei Ji, Zhangjie Cao, Chien-Yi Chang, Juan Carlos Niebles
In this paper, we propose Temporal Alignment Module (TAM), a novel few-shot learning framework that can learn to classify a previous unseen video.
7 code implementations • NeurIPS 2019 • Kaidi Cao, Colin Wei, Adrien Gaidon, Nikos Arechiga, Tengyu Ma
Deep learning algorithms can fare poorly when the training dataset suffers from heavy class-imbalance but the testing criterion requires good generalization on less frequent classes.
Ranked #4 on Long-tail learning with class descriptors on CUB-LT
1 code implementation • ICLR Workshop DeepGenStruct 2019 • Wayne Wu, Kaidi Cao, Cheng Li, Chen Qian, Chen Change Loy
It is challenging to disentangle an object into two orthogonal spaces of content and style since each can influence the visual observation differently and unpredictably.
no code implementations • CVPR 2019 • Wayne Wu, Kaidi Cao, Cheng Li, Chen Qian, Chen Change Loy
Extensive experiments demonstrate the superior performance of our method to other state-of-the-art approaches, especially in the challenging near-rigid and non-rigid objects translation tasks.
no code implementations • 1 Nov 2018 • Kaidi Cao, Jing Liao, Lu Yuan
Facial caricature is an art form of drawing faces in an exaggerated way to convey humor or sarcasm.
no code implementations • 27 Sep 2018 • Wayne Wu, Kaidi Cao, Cheng Li, Chen Qian, Chen Change Loy
It is challenging to disentangle an object into two orthogonal spaces of structure and appearance since each can influence the visual observation in a different and unpredictable way.
no code implementations • 10 Sep 2018 • Xuan Yang, Mingyu Gao, Jing Pu, Ankita Nayak, Qiaoyi Liu, Steven Emberton Bell, Jeff Ou Setter, Kaidi Cao, Heonjae Ha, Christos Kozyrakis, Mark Horowitz
Many DNN accelerators have been proposed and built using different microarchitectures and program mappings.
Distributed, Parallel, and Cluster Computing
1 code implementation • CVPR 2018 • Kaidi Cao, Yu Rong, Cheng Li, Xiaoou Tang, Chen Change Loy
However, many contemporary face recognition models still perform relatively poor in processing profile faces compared to frontal faces.
Ranked #1 on Face Identification on IJB-A
1 code implementation • 13 Jul 2017 • Yue He, Kaidi Cao, Cheng Li, Chen Change Loy
Given a large number of unlabeled face images, face grouping aims at clustering the images into individual identities present in the data.