The Softmax output function transforms a previous layer's output into a vector of probabilities. It is commonly used for multiclass classification. Given an input vector $x$ and a weighting vector $w$ we have:
$$ P(y=j \mid{x}) = \frac{e^{x^{T}w_{j}}}{\sum^{K}_{k=1}e^{x^{T}wk}} $$
Paper | Code | Results | Date | Stars |
---|
Task | Papers | Share |
---|---|---|
Language Modelling | 43 | 5.64% |
Retrieval | 40 | 5.24% |
Question Answering | 27 | 3.54% |
Large Language Model | 23 | 3.01% |
Image Classification | 18 | 2.36% |
Object Detection | 17 | 2.23% |
Decoder | 17 | 2.23% |
Semantic Segmentation | 16 | 2.10% |
Text Generation | 13 | 1.70% |
Component | Type |
|
---|---|---|
🤖 No Components Found | You can add them if they exist; e.g. Mask R-CNN uses RoIAlign |