发明名称 |
LEARNING FRONT-END SPEECH RECOGNITION PARAMETERS WITHIN NEURAL NETWORK TRAINING |
摘要 |
Techniques for learning front-end speech recognition parameters as part of training a neural network classifier include obtaining an input speech signal, and applying front-end speech recognition parameters to extract features from the input speech signal. The extracted features may be fed through a neural network to obtain an output classification for the input speech signal, and an error measure may be computed for the output classification through comparison of the output classification with a known target classification. Back propagation may be applied to adjust one or more of the front-end parameters as one or more layers of the neural network, based on the error measure. |
申请公布号 |
US2015161995(A1) |
申请公布日期 |
2015.06.11 |
申请号 |
US201414561811 |
申请日期 |
2014.12.05 |
申请人 |
Nuance Communications, Inc. |
发明人 |
Sainath Tara N.;Kingsbury Brian E.D.;Mohamed Abdel-rahman;Ramabhadran Bhuvana |
分类号 |
G10L15/16;G10L15/06 |
主分类号 |
G10L15/16 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method comprising:
obtaining a power spectrum of an input frame of a speech signal; passing the power spectrum through a filter bank comprising a plurality of weights to create a filter bank output, each weight of the plurality of weights operating on a subset of frequency components of the power spectrum; processing the filter bank output to generate a set of features of the input frame; feeding the generated features through a neural network to obtain an output to classification; comparing the output classification to a target classification to compute an error measure; and applying back propagation to adjust the plurality of weights as a layer of the neural network based on the error measure. |
地址 |
Burlington MA US |