Connectionist Temporal Classification Loss

A Connectionist Temporal Classification Loss, or CTC Loss, is designed for tasks where we need alignment between sequences, but where that alignment is difficult - e.g. aligning each character to its location in an audio file. It calculates a loss between a continuous (unsegmented) time series and a target sequence. It does this by summing over the probability of possible alignments of input to target, producing a loss value which is differentiable with respect to each input node. The alignment of input to target is assumed to be “many-to-one”, which limits the length of the target sequence such that it must be $\leq$ the input length.

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Speech Recognition	21	28.38%
Automatic Speech Recognition (ASR)	16	21.62%
Language Modelling	8	10.81%
Translation	3	4.05%
Sign Language Recognition	3	4.05%
Lipreading	3	4.05%
Handwritten Text Recognition	2	2.70%
Multi-Task Learning	2	2.70%
General Classification	2	2.70%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
🤖 No Components Found	You can add them if they exist; e.g. Mask R-CNN uses RoIAlign

Categories

Add Remove

Loss Functions