发明名称 LEARNING FRONT-END SPEECH RECOGNITION PARAMETERS WITHIN NEURAL NETWORK TRAINING
摘要 Techniques for learning front-end speech recognition parameters as part of training a neural network classifier include obtaining an input speech signal, and applying front-end speech recognition parameters to extract features from the input speech signal. The extracted features may be fed through a neural network to obtain an output classification for the input speech signal, and an error measure may be computed for the output classification through comparison of the output classification with a known target classification. Back propagation may be applied to adjust one or more of the front-end parameters as one or more layers of the neural network, based on the error measure.
申请公布号 US2015161995(A1) 申请公布日期 2015.06.11
申请号 US201414561811 申请日期 2014.12.05
申请人 Nuance Communications, Inc. 发明人 Sainath Tara N.;Kingsbury Brian E.D.;Mohamed Abdel-rahman;Ramabhadran Bhuvana
分类号 G10L15/16;G10L15/06 主分类号 G10L15/16
代理机构 代理人
主权项 1. A method comprising: obtaining a power spectrum of an input frame of a speech signal; passing the power spectrum through a filter bank comprising a plurality of weights to create a filter bank output, each weight of the plurality of weights operating on a subset of frequency components of the power spectrum; processing the filter bank output to generate a set of features of the input frame; feeding the generated features through a neural network to obtain an output to classification; comparing the output classification to a target classification to compute an error measure; and applying back propagation to adjust the plurality of weights as a layer of the neural network based on the error measure.
地址 Burlington MA US