发明名称 Generic framework for large-margin MCE training in speech recognition
摘要 A method and apparatus for training an acoustic model are disclosed. A training corpus is accessed and converted into an initial acoustic model. Scores are calculated for a correct class and competitive classes, respectively, for each token given the initial acoustic model. Also, a sample-adaptive window bandwidth is calculated for each training token. From the calculated scores and the sample-adaptive window bandwidth values, loss values are calculated based on a loss function. The loss function, which may be derived from a Bayesian risk minimization viewpoint, can include a margin value that moves a decision boundary such that token-to-boundary distances for correct tokens that are near the decision boundary are maximized. The margin can either be a fixed margin or can vary monotonically as a function of algorithm iterations. The acoustic model is updated based on the calculated loss values. This process can be repeated until an empirical convergence is met.
申请公布号 US8423364(B2) 申请公布日期 2013.04.16
申请号 US20070708440 申请日期 2007.02.20
申请人 YU DONG;ACERO ALEJANDRO;DENG LI;HE XIAODONG;MICROSOFT CORPORATION 发明人 YU DONG;ACERO ALEJANDRO;DENG LI;HE XIAODONG
分类号 G10L15/14;G10L15/00;G10L15/06 主分类号 G10L15/14
代理机构 代理人
主权项
地址
您可能感兴趣的专利