发明名称 Method and Apparatus for Speech Recognition Using Neural Networks with Speaker Adaptation
摘要 In a speech recognition system, deep neural networks (DNNs) are employed in phoneme recognition. While DNNs typically provide better phoneme recognition performance than other techniques, such as Gaussian mixture models (GMM), adapting a DNN to a particular speaker is a real challenge. According to at least one example embodiment, speech data and corresponding speaker data are both applied as input to a DNN. In response, the DNN generates a prediction of a phoneme based on the input speech data and the corresponding speaker data. The speaker data may be generated from the corresponding speech data.
申请公布号 US2015161994(A1) 申请公布日期 2015.06.11
申请号 US201314098259 申请日期 2013.12.05
申请人 Nuance Communications, Inc. 发明人 Tang Yun;Nagesha Venkatesh;Fan Xing
分类号 G10L15/16;G10L15/02 主分类号 G10L15/16
代理机构 代理人
主权项 1. A method for speech recognition, the comprising: receiving, by a deep neural network, input speech data and corresponding speaker data; and generating, by the deep neural network, a prediction of a phoneme corresponding to the input speech data based on the corresponding speaker data.
地址 Burlington MA US
您可能感兴趣的专利