Search Results for author: Chris Cundy

Found 10 papers, 4 papers with code

SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

no code implementations • 8 Jun 2023 • Chris Cundy, Stefano Ermon

This allows us to minimize a variety of divergences between the distribution of sequences generated by an autoregressive model and sequences from a dataset, including divergences with weight on OOD generated sequences.

Imitation Learning Text Generation

Paper
Add Code

On the Opportunities and Challenges of Foundation Models for Geospatial Artificial Intelligence

no code implementations • 13 Apr 2023 • Gengchen Mai, Weiming Huang, Jin Sun, Suhang Song, Deepak Mishra, Ninghao Liu, Song Gao, Tianming Liu, Gao Cong, Yingjie Hu, Chris Cundy, Ziyuan Li, Rui Zhu, Ni Lao

In this work, we explore the promises and challenges of developing multimodal foundation models for GeoAI.

Few-Shot Learning Scene Classification +3

Paper
Add Code

LMPriors: Pre-Trained Language Models as Task-Specific Priors

no code implementations • 22 Oct 2022 • Kristy Choi, Chris Cundy, Sanjari Srivastava, Stefano Ermon

Particularly in low-data regimes, an outstanding challenge in machine learning is developing principled techniques for augmenting our models with suitable priors.

Causal Inference Common Sense Reasoning +3

Paper
Add Code

Beyond Bayes-optimality: meta-learning what you know you don't know

no code implementations • 30 Sep 2022 • Jordi Grau-Moya, Grégoire Delétang, Markus Kunesch, Tim Genewein, Elliot Catt, Kevin Li, Anian Ruoss, Chris Cundy, Joel Veness, Jane Wang, Marcus Hutter, Christopher Summerfield, Shane Legg, Pedro Ortega

This is in contrast to risk-sensitive agents, which additionally exploit the higher-order moments of the return, and ambiguity-sensitive agents, which act differently when recognizing situations in which they lack knowledge.

Decision Making Meta-Learning

Paper
Add Code

Neural Networks and the Chomsky Hierarchy

2 code implementations • 5 Jul 2022 • Grégoire Delétang, Anian Ruoss, Jordi Grau-Moya, Tim Genewein, Li Kevin Wenliang, Elliot Catt, Chris Cundy, Marcus Hutter, Shane Legg, Joel Veness, Pedro A. Ortega

Reliable generalization lies at the heart of safe ML and AI.

158

Paper
Code

BCD Nets: Scalable Variational Approaches for Bayesian Causal Discovery

1 code implementation • NeurIPS 2021 • Chris Cundy, Aditya Grover, Stefano Ermon

We propose Bayesian Causal Discovery Nets (BCD Nets), a variational inference framework for estimating a distribution over DAGs characterizing a linear-Gaussian SEM.

Causal Discovery Stochastic Optimization +1

Paper
Code

IQ-Learn: Inverse soft-Q Learning for Imitation

5 code implementations • NeurIPS 2021 • Divyansh Garg, Shuvam Chakraborty, Chris Cundy, Jiaming Song, Matthieu Geist, Stefano Ermon

In many sequential decision-making problems (e. g., robotics control, game playing, sequential prediction), human or expert data is available containing useful information about the task.

Ranked #1 on MuJoCo Games on Walker2d

Atari Games Continuous Control +3

185

Paper
Code

Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients

no code implementations • 30 Dec 2020 • Chris Cundy, Rishi Desai, Stefano Ermon

We consider the task of training a policy that maximizes reward while minimizing disclosure of certain sensitive state variables through the actions.

Decision Making

Paper
Add Code

Exploring Hierarchy-Aware Inverse Reinforcement Learning

no code implementations • 13 Jul 2018 • Chris Cundy, Daniel Filan

We introduce a new generative model for human planning under the Bayesian Inverse Reinforcement Learning (BIRL) framework which takes into account the fact that humans often plan using hierarchical strategies.

BIRL reinforcement-learning +1

Paper
Add Code

Parallelizing Linear Recurrent Neural Nets Over Sequence Length

1 code implementation • ICLR 2018 • Eric Martin, Chris Cundy

Recurrent neural networks (RNNs) are widely used to model sequential data but their non-linear dependencies between sequence elements prevent parallelizing training over sequence length.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.