Search Results for author: Yiding Jiang

Found 12 papers, 5 papers with code

Predicting the Generalization Gap in Deep Networks with Margin Distributions

2 code implementations • ICLR 2019 • Yiding Jiang, Dilip Krishnan, Hossein Mobahi, Samy Bengio

In this paper, we propose such a measure, and conduct extensive empirical studies on how well it can predict the generalization gap.

32,808

Paper
Code

Language as an Abstraction for Hierarchical Deep Reinforcement Learning

2 code implementations • NeurIPS 2019 • Yiding Jiang, Shixiang Gu, Kevin Murphy, Chelsea Finn

We find that, using our approach, agents can learn to solve to diverse, temporally-extended tasks such as object sorting and multi-object rearrangement, including from raw pixel observations.

Instruction Following Object +2

124

Paper
Code

Fantastic Generalization Measures and Where to Find Them

3 code implementations • ICLR 2020 • Yiding Jiang, Behnam Neyshabur, Hossein Mobahi, Dilip Krishnan, Samy Bengio

We present the first large scale study of generalization in deep networks.

valid

Paper
Code

Observational Overfitting in Reinforcement Learning

no code implementations • ICLR 2020 • Xingyou Song, Yiding Jiang, Stephen Tu, Yilun Du, Behnam Neyshabur

A major component of overfitting in model-free reinforcement learning (RL) involves the case where the agent may mistakenly correlate reward with certain spurious features from the observations generated by the Markov Decision Process (MDP).

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

NeurIPS 2020 Competition: Predicting Generalization in Deep Learning

no code implementations • 14 Dec 2020 • Yiding Jiang, Pierre Foret, Scott Yak, Daniel M. Roy, Hossein Mobahi, Gintare Karolina Dziugaite, Samy Bengio, Suriya Gunasekar, Isabelle Guyon, Behnam Neyshabur

Understanding generalization in deep learning is arguably one of the most important questions in deep learning.

Decision Making Generalization Bounds

Paper
Add Code

Ask & Explore: Grounded Question Answering for Curiosity-Driven Exploration

no code implementations • 24 Apr 2021 • Jivat Neet Kaur, Yiding Jiang, Paul Pu Liang

In many real-world scenarios where extrinsic rewards to the agent are extremely sparse, curiosity has emerged as a useful concept providing intrinsic rewards that enable the agent to explore its environment and acquire information to achieve its goals.

Question Answering

Paper
Add Code

Assessing Generalization of SGD via Disagreement

no code implementations • ICLR 2022 • Yiding Jiang, Vaishnavh Nagarajan, Christina Baek, J. Zico Kolter

We empirically show that the test error of deep networks can be estimated by simply training the same architecture on the same training set but with a different run of Stochastic Gradient Descent (SGD), and measuring the disagreement rate between the two networks on unlabeled test data.

Paper
Add Code

Agreement-on-the-Line: Predicting the Performance of Neural Networks under Distribution Shift

1 code implementation • 27 Jun 2022 • Christina Baek, Yiding Jiang, aditi raghunathan, Zico Kolter

In this paper, we show a similar but surprising phenomenon also holds for the agreement between pairs of neural network classifiers: whenever accuracy-on-the-line holds, we observe that the OOD agreement between the predictions of any two pairs of neural networks (with potentially different architectures) also observes a strong linear correlation with their ID agreement.

Model Selection

Paper
Code

Learning Options via Compression

1 code implementation • 8 Dec 2022 • Yiding Jiang, Evan Zheran Liu, Benjamin Eysenbach, Zico Kolter, Chelsea Finn

Identifying statistical regularities in solutions to some tasks in multi-task reinforcement learning can accelerate the learning of new tasks.

Paper
Code

On the Joint Interaction of Models, Data, and Features

no code implementations • 7 Jun 2023 • Yiding Jiang, Christina Baek, J. Zico Kolter

Thus, we believe this work provides valuable new insight into our understanding of feature learning.

Paper
Add Code

Understanding prompt engineering may not require rethinking generalization

no code implementations • 6 Oct 2023 • Victor Akinwande, Yiding Jiang, Dylan Sam, J. Zico Kolter

Zero-shot learning in prompted vision-language models, the practice of crafting prompts to build classifiers without an explicit training process, has achieved impressive performance in many settings.

Generalization Bounds Language Modelling +3

Paper
Add Code

Automated Black-box Prompt Engineering for Personalized Text-to-Image Generation

no code implementations • 28 Mar 2024 • Yutong He, Alexander Robey, Naoki Murata, Yiding Jiang, Joshua Williams, George J. Pappas, Hamed Hassani, Yuki Mitsufuji, Ruslan Salakhutdinov, J. Zico Kolter

Prompt engineering is effective for controlling the output of text-to-image (T2I) generative models, but it is also laborious due to the need for manually crafted prompts.

In-Context Learning Language Modelling +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.