A Dutch Eligibility Trace is a type of eligibility trace where the trace increments grow less quickly than the accumulative eligibility trace (helping avoid large variance updates). For the memory vector $\textbf{e}_{t} \in \mathbb{R}^{b} \geq \textbf{0}$:
$$\mathbf{e_{0}} = \textbf{0}$$
$$\textbf{e}_{t} = \gamma\lambda\textbf{e}_{t-1} + \left(1-\alpha\gamma\lambda\textbf{e}_{t-1}^{T}\phi_{t}\right)\phi_{t}$$
Paper | Code | Results | Date | Stars |
---|
Component | Type |
|
---|---|---|
🤖 No Components Found | You can add them if they exist; e.g. Mask R-CNN uses RoIAlign |