Mutual Information Maximization for Effective Lip Reading

13 Mar 2020 Xing Zhao Shuang Yang Shiguang Shan Xilin Chen

Lip reading has received an increasing research interest in recent years due to the rapid development of deep learning and its widespread potential applications. One key point to obtain good performance for the lip reading task depends heavily on how effective the representation can be to capture the lip movement information and meanwhile to resist the noises resulted from the change of pose, lighting conditions, speaker's appearance and so on... (read more)

PDF Abstract
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Lipreading Lip Reading in the Wild 3D Conv + ResNet-18 + BGRU Top-1 Accuracy 84.41 # 6
Lipreading LRW-1000 GLMIM Top-1 Accuracy 38.79% # 6

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet