发明名称 |
Method and Apparatus for Speech Recognition Using Neural Networks with Speaker Adaptation |
摘要 |
In a speech recognition system, deep neural networks (DNNs) are employed in phoneme recognition. While DNNs typically provide better phoneme recognition performance than other techniques, such as Gaussian mixture models (GMM), adapting a DNN to a particular speaker is a real challenge. According to at least one example embodiment, speech data and corresponding speaker data are both applied as input to a DNN. In response, the DNN generates a prediction of a phoneme based on the input speech data and the corresponding speaker data. The speaker data may be generated from the corresponding speech data. |
申请公布号 |
US2015161994(A1) |
申请公布日期 |
2015.06.11 |
申请号 |
US201314098259 |
申请日期 |
2013.12.05 |
申请人 |
Nuance Communications, Inc. |
发明人 |
Tang Yun;Nagesha Venkatesh;Fan Xing |
分类号 |
G10L15/16;G10L15/02 |
主分类号 |
G10L15/16 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for speech recognition, the comprising:
receiving, by a deep neural network, input speech data and corresponding speaker data; and generating, by the deep neural network, a prediction of a phoneme corresponding to the input speech data based on the corresponding speaker data. |
地址 |
Burlington MA US |