A Connectionist Temporal Classification Loss, or CTC Loss, is designed for tasks where we need alignment between sequences, but where that alignment is difficult - e.g. aligning each character to its location in an audio file. It calculates a loss between a continuous (unsegmented) time series and a target sequence. It does this by summing over the probability of possible alignments of input to target, producing a loss value which is differentiable with respect to each input node. The alignment of input to target is assumed to be “many-to-one”, which limits the length of the target sequence such that it must be $\leq$ the input length.
Paper | Code | Results | Date | Stars |
---|
Task | Papers | Share |
---|---|---|
Speech Recognition | 23 | 25.27% |
Automatic Speech Recognition (ASR) | 18 | 19.78% |
Language Modelling | 10 | 10.99% |
Decoder | 6 | 6.59% |
Handwritten Text Recognition | 3 | 3.30% |
Translation | 3 | 3.30% |
Sign Language Recognition | 3 | 3.30% |
Lipreading | 3 | 3.30% |
Multi-Task Learning | 2 | 2.20% |
Component | Type |
|
---|---|---|
🤖 No Components Found | You can add them if they exist; e.g. Mask R-CNN uses RoIAlign |