LCANet: End-to-End Lipreading with Cascaded Attention-CTC

13 Mar 2018Kai XuDawei LiNick CassimatisXiaolong Wang

Machine lipreading is a special type of automatic speech recognition (ASR) which transcribes human speech by visually interpreting the movement of related face regions including lips, face, and tongue. Recently, deep neural network based lipreading methods show great potential and have exceeded the accuracy of experienced human lipreaders in some benchmark datasets... (read more)

PDF Abstract

Code


No code implementations yet. Submit your code now
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Lipreading GRID corpus (mixed-speech) LCANet Word Error Rate (WER) 2.9 # 1

Methods used in the Paper