视觉语音识别 - 视觉语音识别技术,学习,经验文章

提娜米苏

8 个月前

[论文笔记] 基于 LSTM 的端到端视觉语音识别 (End-to-End Visual Speech Recognition with LSTMs)原文标题：End-to-End Visual Speech Recognition with LSTMs 发表年份：2017 核心思想：如何显式地让网络同时关注唇部的“形状”和“运动”,实现从像素到语义的端到端识别。