Search Results for author: Stephan Zheng

Found 33 papers, 18 papers with code

Social Environment Design

1 code implementation21 Feb 2024 Edwin Zhang, Sadie Zhao, Tonghan Wang, Safwan Hossain, Henry Gasztowtt, Stephan Zheng, David C. Parkes, Milind Tambe, YiLing Chen

Artificial Intelligence (AI) holds promise as a technology that can be used to improve government and economic policy-making.

Decision Making

AI For Global Climate Cooperation 2023 Competition Proceedings

no code implementations10 Jul 2023 Yoshua Bengio, Prateek Gupta, Lu Li, Soham Phade, Sunil Srinivasa, Andrew Williams, Tianyu Zhang, Yang Zhang, Stephan Zheng

On the other hand, an interdisciplinary panel of human experts in law, policy, sociology, economics and environmental science, evaluated the solutions qualitatively.

Decision Making Ethics +1

MERMAIDE: Learning to Align Learners using Model-Based Meta-Learning

no code implementations10 Apr 2023 Arundhati Banerjee, Soham Phade, Stefano Ermon, Stephan Zheng

We then show that our model-based meta-learning approach is cost-effective in intervening on bandit agents with unseen explore-exploit strategies.

Meta-Learning

AI for Global Climate Cooperation: Modeling Global Climate Negotiations, Agreements, and Long-Term Cooperation in RICE-N

2 code implementations15 Aug 2022 Tianyu Zhang, Andrew Williams, Soham Phade, Sunil Srinivasa, Yang Zhang, Prateek Gupta, Yoshua Bengio, Stephan Zheng

To facilitate this research, here we introduce RICE-N, a multi-region integrated assessment model that simulates the global climate and economy, and which can be used to design and evaluate the strategic outcomes for different negotiation and agreement frameworks.

Ethics Multi-agent Reinforcement Learning

Solving Dynamic Principal-Agent Problems with a Rationally Inattentive Principal

1 code implementation18 Jan 2022 Tong Mu, Stephan Zheng, Alexander Trott

In a sequential setting with multiple Agents, RIRL shows opposing consequences of the Principal's inattention to different information channels: 1) inattention to Agents' outputs closes wage gaps based on ability differences; and 2) inattention to Agents' efforts induces a social dilemma dynamic in which Agents work harder, but essentially for free.

Multi-agent Reinforcement Learning

Analyzing Micro-Founded General Equilibrium Models with Many Agents using Deep Reinforcement Learning

no code implementations3 Jan 2022 Michael Curry, Alexander Trott, Soham Phade, Yu Bai, Stephan Zheng

We validate the learned solutions are $\epsilon$-meta-equilibria through best-response analyses, show that they align with economic intuitions, and show our approach can learn a spectrum of qualitatively distinct $\epsilon$-meta-equilibria in open RBC models.

Multi-agent Reinforcement Learning reinforcement-learning +1

WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU

3 code implementations31 Aug 2021 Tian Lan, Sunil Srinivasa, Huan Wang, Stephan Zheng

We present WarpDrive, a flexible, lightweight, and easy-to-use open-source RL framework that implements end-to-end deep multi-agent RL on a single GPU (Graphics Processing Unit), built on PyCUDA and PyTorch.

Decision Making Multi-agent Reinforcement Learning +2

Building a Foundation for Data-Driven, Interpretable, and Robust Policy Design using the AI Economist

1 code implementation6 Aug 2021 Alexander Trott, Sunil Srinivasa, Douwe van der Wal, Sebastien Haneuse, Stephan Zheng

Here we show that the AI Economist framework enables effective, flexible, and interpretable policy design using two-level reinforcement learning (RL) and data-driven simulations.

Reinforcement Learning (RL)

The AI Economist: Optimal Economic Policy Design via Two-level Deep Reinforcement Learning

1 code implementation5 Aug 2021 Stephan Zheng, Alexander Trott, Sunil Srinivasa, David C. Parkes, Richard Socher

Here we show that machine-learning-based economic simulation is a powerful policy and mechanism design framework to overcome these limitations.

counterfactual reinforcement-learning +1

Robustness Gym: Unifying the NLP Evaluation Landscape

2 code implementations NAACL 2021 Karan Goel, Nazneen Rajani, Jesse Vig, Samson Tan, Jason Wu, Stephan Zheng, Caiming Xiong, Mohit Bansal, Christopher Ré

Despite impressive performance on standard benchmarks, deep neural networks are often brittle when deployed in real-world systems.

Entity Linking

ERMAS: Learning Policies Robust to Reality Gaps in Multi-Agent Simulations

no code implementations1 Jan 2021 Eric Zhao, Alexander R Trott, Caiming Xiong, Stephan Zheng

Policies for real-world multi-agent problems, such as optimal taxation, can be learned in multi-agent simulations with AI agents that emulate humans.

Meta-Learning

The Rise of AI-Driven Simulators: Building a New Crystal Ball

no code implementations11 Dec 2020 Ian Foster, David Parkes, Stephan Zheng

These advances may lead to a new era in computational simulation, in which sensors of many kinds are used to produce vast quantities of data, AI methods identify patterns in those data, and new AI-driven simulators combine machine-learned and mathematical rules to make accurate and actionable predictions.

ESPRIT: Explaining Solutions to Physical Reasoning Tasks

2 code implementations ACL 2020 Nazneen Fatema Rajani, Rui Zhang, Yi Chern Tan, Stephan Zheng, Jeremy Weiss, Aadit Vyas, Abhijit Gupta, Caiming Xiong, Richard Socher, Dragomir Radev

Our framework learns to generate explanations of how the physical simulation will causally evolve so that an agent or a human can easily reason about a solution using those interpretable descriptions.

The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies

2 code implementations28 Apr 2020 Stephan Zheng, Alexander Trott, Sunil Srinivasa, Nikhil Naik, Melvin Gruesbeck, David C. Parkes, Richard Socher

In experiments conducted on MTurk, an AI tax policy provides an equality-productivity trade-off that is similar to that provided by the Saez framework along with higher inverse-income weighted social welfare.

Multiresolution Tensor Learning for Efficient and Interpretable Spatial Analysis

1 code implementation ICML 2020 Jung Yeon Park, Kenneth Theo Carr, Stephan Zheng, Yisong Yue, Rose Yu

Efficient and interpretable spatial analysis is crucial in many fields such as geology, sports, and climate science.

Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards

1 code implementation NeurIPS 2019 Alexander Trott, Stephan Zheng, Caiming Xiong, Richard Socher

For instance, in tasks where the agent must achieve some goal state, simple distance-to-goal reward shaping often fails, as it renders learning vulnerable to local optima.

Learning World Graphs to Accelerate Hierarchical Reinforcement Learning

no code implementations1 Jul 2019 Wenling Shang, Alex Trott, Stephan Zheng, Caiming Xiong, Richard Socher

We perform a thorough ablation study to evaluate our approach on a suite of challenging maze tasks, demonstrating significant advantages from the proposed framework over baselines that lack world graph knowledge in terms of performance and efficiency.

Hierarchical Reinforcement Learning reinforcement-learning +1

On the Generalization Gap in Reparameterizable Reinforcement Learning

no code implementations29 May 2019 Huan Wang, Stephan Zheng, Caiming Xiong, Richard Socher

For this problem class, estimating the expected return is efficient and the trajectory can be computed deterministically given peripheral random variables, which enables us to study reparametrizable RL using supervised learning and transfer learning theory.

Learning Theory reinforcement-learning +2

Novel deep learning methods for track reconstruction

3 code implementations14 Oct 2018 Steven Farrell, Paolo Calafiura, Mayur Mudigonda, Prabhat, Dustin Anderson, Jean-Roch Vlimant, Stephan Zheng, Josh Bendavid, Maria Spiropulu, Giuseppe Cerati, Lindsey Gray, Jim Kowalkowski, Panagiotis Spentzouris, Aristeidis Tsaris

The second set of models use Graph Neural Networks (GNNs) for the tasks of hit classification and segment classification.

High Energy Physics - Experiment Data Analysis, Statistics and Probability

Generating Multi-Agent Trajectories using Programmatic Weak Supervision

2 code implementations ICLR 2019 Eric Zhan, Stephan Zheng, Yisong Yue, Long Sha, Patrick Lucey

We study the problem of training sequential generative models for capturing coordinated multi-agent trajectory behavior, such as offensive basketball gameplay.

Imitation Learning

Detecting Adversarial Examples via Neural Fingerprinting

1 code implementation11 Mar 2018 Sumanth Dathathri, Stephan Zheng, Tianwei Yin, Richard M. Murray, Yisong Yue

Deep neural networks are vulnerable to adversarial examples, which dramatically alter model output using small input changes.

Multi-resolution Tensor Learning for Large-Scale Spatial Data

no code implementations19 Feb 2018 Stephan Zheng, Rose Yu, Yisong Yue

High-dimensional tensor models are notoriously computationally expensive to train.

Meta-Learning

Structured Exploration via Hierarchical Variational Policy Networks

no code implementations ICLR 2018 Stephan Zheng, Yisong Yue

Reinforcement learning in environments with large state-action spaces is challenging, as exploration can be highly inefficient.

Variational Inference

Long-term Forecasting using Tensor-Train RNNs

no code implementations ICLR 2018 Rose Yu, Stephan Zheng, Anima Anandkumar, Yisong Yue

We present Tensor-Train RNN (TT-RNN), a novel family of neural sequence architectures for multivariate forecasting in environments with nonlinear dynamics.

Long-term Forecasting using Higher Order Tensor RNNs

1 code implementation ICLR 2018 Rose Yu, Stephan Zheng, Anima Anandkumar, Yisong Yue

We present Higher-Order Tensor RNN (HOT-RNN), a novel family of neural sequence architectures for multivariate forecasting in environments with nonlinear dynamics.

Time Series Time Series Analysis

Generating Long-term Trajectories Using Deep Hierarchical Networks

no code implementations NeurIPS 2016 Stephan Zheng, Yisong Yue, Patrick Lucey

We study the problem of modeling spatiotemporal trajectories over long time horizons using expert demonstrations.

Improving the Robustness of Deep Neural Networks via Stability Training

no code implementations CVPR 2016 Stephan Zheng, Yang song, Thomas Leung, Ian Goodfellow

In this paper we address the issue of output instability of deep neural networks: small perturbations in the visual input can significantly distort the feature embeddings and output of a neural network.

General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.