Browse > Reasoning > Decision Making

Decision Making

112 papers with code · Reasoning

State-of-the-art leaderboards

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling

ICLR 2018 tensorflow/models

At the same time, advances in approximate Bayesian methods have made posterior approximation for flexible neural network models practical. To understand the impact of using an approximate posterior on Thompson Sampling, we benchmark well-established and recently developed methods for approximate posterior sampling combined with Thompson Sampling over a series of contextual bandit problems.

DECISION MAKING MULTI-ARMED BANDITS

Relational inductive biases, deep learning, and graph networks

4 Jun 2018deepmind/graph_nets

This has been due, in part, to cheap data and cheap compute resources, which have fit the natural strengths of deep learning. As a companion to this paper, we have released an open-source software library for building graph networks, with demonstrations of how to use them in practice.

DECISION MAKING RELATIONAL REASONING

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

ICML 2018 facebookresearch/Horizon

Model-free deep reinforcement learning (RL) algorithms have been demonstrated on a range of challenging decision making and control tasks. In this paper, we propose soft actor-critic, an off-policy actor-critic deep RL algorithm based on the maximum entropy reinforcement learning framework.

CONTINUOUS CONTROL DECISION MAKING Q-LEARNING

Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene Understanding

9 Nov 2015alexgkendall/caffe-segnet

Semantic segmentation is an important tool for visual scene understanding and a meaningful measure of uncertainty is essential for decision making. Our contribution is a practical system which is able to predict pixel-wise class labels with a measure of model uncertainty.

DECISION MAKING SCENE UNDERSTANDING SEMANTIC SEGMENTATION

AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias

3 Oct 2018IBM/AIF360

Fairness is an increasingly important concern as machine learning models are used to support decision making in high-stakes applications such as mortgage lending, hiring, and prison sentencing. Such architectural design and abstractions enable researchers and developers to extend the toolkit with their new algorithms and improvements, and to use it for performance benchmarking.

DECISION MAKING

Joint Learning of Sentence Embeddings for Relevance and Entailment

WS 2016 brmson/dataset-sts

We consider the problem of Recognizing Textual Entailment within an Information Retrieval context, where we must simultaneously determine the relevancy as well as degree of entailment for individual pieces of evidence to determine a yes/no answer to a binary natural language question. We compare several variants of neural networks for sentence embeddings in a setting of decision-making based on evidence of varying relevance.

DECISION MAKING NATURAL LANGUAGE INFERENCE READING COMPREHENSION SENTENCE EMBEDDINGS

Deep Reinforcement Learning For Sequence to Sequence Models

24 May 2018yaserkl/RLSeq2Seq

In this survey, we consider seq2seq problems from the RL point of view and provide a formulation combining the power of RL methods in decision-making with sequence-to-sequence models that enable remembering long-term memories. We also provide the source code for implementing most of the RL models discussed in this paper to support the complex task of abstractive text summarization.

ABSTRACTIVE TEXT SUMMARIZATION DECISION MAKING MACHINE TRANSLATION

Visual Interaction Networks

5 Jun 2017gitlimlab/Relation-Network-Tensorflow

We introduce the Visual Interaction Network, a general-purpose model for learning the dynamics of a physical system from raw visual observations. We found that from just six input video frames the Visual Interaction Network can generate accurate future trajectories of hundreds of time steps on a wide range of physical systems.

DECISION MAKING

A Probabilistic U-Net for Segmentation of Ambiguous Images

NeurIPS 2018 SimonKohl/probabilistic_unet

To this end we propose a generative segmentation model based on a combination of a U-Net with a conditional variational autoencoder that is capable of efficiently producing an unlimited number of plausible hypotheses. We show on a lung abnormalities segmentation task and on a Cityscapes segmentation task that our model reproduces the possible segmentation variants as well as the frequencies with which they occur, doing so significantly better than published approaches.

DECISION MAKING SEMANTIC SEGMENTATION

Soft Actor-Critic Algorithms and Applications

13 Dec 2018rail-berkeley/softlearning

However, these methods typically suffer from two major challenges: high sample complexity and brittleness to hyperparameters. We systematically evaluate SAC on a range of benchmark tasks, as well as real-world challenging tasks such as locomotion for a quadrupedal robot and robotic manipulation with a dexterous hand.

DECISION MAKING