A Stochastic Dueling Network, or SDN, is an architecture for learning a value function $V$. The SDN learns both $V$ and $Q$ off-policy while maintaining consistency between the two estimates. At each time step it outputs a stochastic estimate of $Q$ and a deterministic estimate of $V$.
Source: Sample Efficient Actor-Critic with Experience ReplayPaper | Code | Results | Date | Stars |
---|
Task | Papers | Share |
---|---|---|
Reinforcement Learning (RL) | 5 | 26.32% |
Face Anti-Spoofing | 3 | 15.79% |
Face Recognition | 3 | 15.79% |
Problem Decomposition | 2 | 10.53% |
Face Presentation Attack Detection | 1 | 5.26% |
Automatic Speech Recognition (ASR) | 1 | 5.26% |
Benchmarking | 1 | 5.26% |
Speech Recognition | 1 | 5.26% |
Spoken Dialogue Systems | 1 | 5.26% |
Component | Type |
|
---|---|---|
🤖 No Components Found | You can add them if they exist; e.g. Mask R-CNN uses RoIAlign |