Dropout is a regularization technique for neural networks that drops a unit (along with connections) at training time with a specified probability $p$ (a common value is $p=0.5$). At test time, all units are present, but with weights scaled by $p$ (i.e. $w$ becomes $pw$).
The idea is to prevent co-adaptation, where the neural network becomes too reliant on particular connections, as this could be symptomatic of overfitting. Intuitively, dropout can be thought of as creating an implicit ensemble of neural networks.
Source: Dropout: A Simple Way to Prevent Neural Networks from OverfittingPaper | Code | Results | Date | Stars |
---|
Task | Papers | Share |
---|---|---|
Language Modelling | 54 | 6.26% |
RAG | 48 | 5.57% |
Retrieval | 42 | 4.87% |
Large Language Model | 31 | 3.60% |
Question Answering | 22 | 2.55% |
Decoder | 21 | 2.44% |
Semantic Segmentation | 18 | 2.09% |
Text Generation | 16 | 1.86% |
Image Classification | 13 | 1.51% |
Component | Type |
|
---|---|---|
🤖 No Components Found | You can add them if they exist; e.g. Mask R-CNN uses RoIAlign |