Browse > Computer Code > Program Synthesis > Value prediction

Value prediction

6 papers with code · Computer Code
Subtask of Program Synthesis

Leaderboards

TREND DATASET BEST METHOD PAPER TITLE PAPER CODE COMPARE

Greatest papers with code

ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search

6 Nov 2018ShangtongZhang/DeepRL

In this paper, we propose an actor ensemble algorithm, named ACE, for continuous control with a deterministic policy in reinforcement learning.

CONTINUOUS CONTROL VALUE PREDICTION

Value Prediction Network

NeurIPS 2017 junhyukoh/value-prediction-network

This paper proposes a novel deep reinforcement learning (RL) architecture, called Value Prediction Network (VPN), which integrates model-free and model-based RL methods into a single neural network.

ATARI GAMES VALUE PREDICTION

TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning

ICLR 2018 oxwhirl/treeqn

To address these challenges, we propose TreeQN, a differentiable, recursive, tree-structured model that serves as a drop-in replacement for any value function network in deep RL with discrete actions.

ATARI GAMES VALUE PREDICTION

Multi-task Neural Network for Non-discrete Attribute Prediction in Knowledge Graphs

16 Aug 2017BaeSeulki/WhySoMuch

Unfortunately, many state-of-the-art relational learning models ignore this information due to the challenging nature of dealing with non-discrete data types in the inherently binary-natured knowledge graphs.

KNOWLEDGE GRAPHS MULTI-TASK LEARNING RELATIONAL REASONING VALUE PREDICTION

Code Prediction by Feeding Trees to Transformers

30 Mar 2020facebookresearch/code-prediction-transformer

We provide comprehensive experimental evaluation of our proposal, along with alternative design choices, on a standard Python dataset, as well as on a Python corpus internal to Facebook.

TYPE PREDICTION VALUE PREDICTION

A Closer Look at Deep Policy Gradients

ICLR 2020 BPDanek/learning_resources

We study how the behavior of deep policy gradient algorithms reflects the conceptual framework motivating their development.

VALUE PREDICTION