发明名称 Multi-frame prediction for hybrid neural network/hidden Markov models
摘要 A method and system for multi-frame prediction in a hybrid neural network/hidden Markov model automatic speech recognition (ASR) system is disclosed. An audio input signal may be transformed into a time sequence of feature vectors, each corresponding to respective temporal frame of a sequence of periodic temporal frames of the audio input signal. The time sequence of feature vectors may be concurrently input to a neural network, which may process them concurrently. In particular, the neural network may concurrently determine for the time sequence of feature vectors a set of emission probabilities for a plurality of hidden Markov models of the ASR system, where the set of emission probabilities are associated with the temporal frames. The set of emission probabilities may then be concurrently applied to the hidden Markov models for determining speech content of the audio input signal.
申请公布号 US8442821(B1) 申请公布日期 2013.05.14
申请号 US201213560706 申请日期 2012.07.27
申请人 VANHOUCKE VINCENT;GOOGLE INC. 发明人 VANHOUCKE VINCENT
分类号 G10L15/14 主分类号 G10L15/14
代理机构 代理人
主权项
地址