发明名称 Posterior-based feature with partial distance elimination for speech recognition
摘要 A high-dimensional posterior-based feature with partial distance elimination may be utilized for speech recognition. The log likelihood values of a large number of Gaussians are needed to generate the high-dimensional posterior feature. Gaussians with very small log likelihoods are associated with zero posterior values. Log likelihoods for Gaussians for a speech frame may be evaluated with a partial distance elimination method. If the partial distance of a Gaussian is already too small, the Gaussian will have a zero posterior value. The partial distance may be calculated by sequentially adding individual dimensions in a group of dimensions. The partial distance elimination occurs when less than all of the dimensions in the group are sequentially added.
申请公布号 US9336775(B2) 申请公布日期 2016.05.10
申请号 US201313785168 申请日期 2013.03.05
申请人 MICROSOFT TECHNOLOGY LICENSING, LLC 发明人 Li Jinyu;Yan Zhijie;Huo Qiang;Gong Yifan
分类号 G10L15/28;G10L15/14;G10L15/00;G10L15/10 主分类号 G10L15/28
代理机构 代理人 Holmes Danielle Johnston;Spellman Steven;Minhas Micky
主权项 1. A method of utilizing a posterior-based feature with partial distance elimination for speech recognition, comprising: receiving, by a computer, an utterance comprising a plurality of speech frames; and evaluating, by the computer, a plurality of log likelihoods of Gaussians for a speech frame to calculate a partial distance by sequentially adding a plurality of dimensions, the partial distance elimination occurring when less than all of the plurality of dimensions are sequentially added; wherein the evaluating comprises maintaining a bound for the plurality of log likelihoods, the bound being determined by comparing the plurality of log likelihoods of Gaussians to a threshold value.
地址 Redmond WA US