发明名称 Model restructuring for client and server based automatic speech recognition
摘要 Access is obtained to a large reference acoustic model for automatic speech recognition. The large reference acoustic model has L states modeled by L mixture models, and the large reference acoustic model has N components. A desired number of components Nc, less than N, to be used in a restructured acoustic model derived from the reference acoustic model, is identified. The desired number of components Nc is selected based on a computing environment in which the restructured acoustic model is to be deployed. The restructured acoustic model also has L states. For each given one of the L mixture models in the reference acoustic model, a merge sequence is built which records, for a given cost function, sequential mergers of pairs of the components associated with the given one of the mixture models. A portion of the Nc components is assigned to each of the L states in the restructured acoustic model. The restructured acoustic model is built by, for each given one of the L states in the restructured acoustic model, applying the merge sequence to a corresponding one of the L mixture models in the reference acoustic model until the portion of the Nc components assigned to the given one of the L states is achieved.
申请公布号 US8635067(B2) 申请公布日期 2014.01.21
申请号 US20100964433 申请日期 2010.12.09
申请人 DOGNIN PIERRE;GOEL VAIBHAVA;HERSHEY JOHN R.;OLSEN PEDER A.;INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 DOGNIN PIERRE;GOEL VAIBHAVA;HERSHEY JOHN R.;OLSEN PEDER A.
分类号 G10L15/14 主分类号 G10L15/14
代理机构 代理人
主权项
地址