Efficient Exploration

144 papers with code • 0 benchmarks • 2 datasets

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Benchmarks

Add a Result

These leaderboards are used to track progress in Efficient Exploration

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Libraries

Use these libraries to find Efficient Exploration models and implementations

RonanFR/UCRL

2 papers

Datasets

Latest papers with no code

Most implemented Social Latest No code

Streamlining Ocean Dynamics Modeling with Fourier Neural Operators: A Multiobjective Hyperparameter and Architecture Optimization Approach

no code yet • 7 Apr 2024

The experimental results show that the optimal set of hyperparameters enhanced model performance in single timestepping forecasting and greatly exceeded the baseline configuration in the autoregressive rollout for long-horizon forecasting up to 30 days.

Paper
Add Code

Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration

no code yet • 31 Mar 2024

Recent advancements in deep reinforcement learning (RL) have demonstrated notable progress in sample efficiency, spanning both model-based and model-free paradigms.

Paper
Add Code

VDSC: Enhancing Exploration Timing with Value Discrepancy and State Counts

no code yet • 26 Mar 2024

While more sophisticated exploration strategies can excel in specific, often sparse reward environments, existing simpler approaches, such as $\epsilon$-greedy, persist in outperforming them across a broader spectrum of domains.

Paper
Add Code

Explore until Confident: Efficient Exploration for Embodied Question Answering

no code yet • 23 Mar 2024

We consider the problem of Embodied Question Answering (EQA), which refers to settings where an embodied agent such as a robot needs to actively explore an environment to gather information until it is confident about the answer to a question.

Paper
Add Code

Safe Reinforcement Learning for Constrained Markov Decision Processes with Stochastic Stopping Time

no code yet • 23 Mar 2024

In this paper, we present an online reinforcement learning algorithm for constrained Markov decision processes with a safety constraint.

Paper
Add Code

Efficient exploration of high-Tc superconductors by a gradient-based composition design

no code yet • 20 Mar 2024

We propose a material design method via gradient-based optimization on compositions, overcoming the limitations of traditional methods: exhaustive database searches and conditional generation models.

Paper
Add Code

Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning

no code yet • 13 Mar 2024

Multi-Agent Reinforcement Learning (MARL) algorithms face the challenge of efficient exploration due to the exponential increase in the size of the joint state-action space.

Paper
Add Code

Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces

no code yet • 7 Mar 2024

Neural Radiance Fields (NeRF) have quickly become the primary approach for 3D reconstruction and novel view synthesis in recent years due to their remarkable performance.

Paper
Add Code

Vlearn: Off-Policy Learning with Efficient State-Value Function Estimation

no code yet • 7 Mar 2024

Existing off-policy reinforcement learning algorithms typically necessitate an explicit state-action-value function representation, which becomes problematic in high-dimensional action spaces.

Paper
Add Code

Noisy Spiking Actor Network for Exploration

no code yet • 7 Mar 2024

As a general method for exploration in deep reinforcement learning (RL), NoisyNet can produce problem-specific exploration strategies.

Paper
Add Code

Efficient Exploration

Benchmarks Add a Result

Libraries

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result