发明名称 MODEL RESTRUCTURING FOR CLIENT AND SERVER BASED AUTOMATIC SPEECH RECOGNITION
摘要 Access is obtained to a large reference acoustic model for automatic speech recognition. The large reference acoustic model has L states modeled by L mixture models, and the large reference acoustic model has N components. A desired number of components Nc, less than N, to be used in a restructured acoustic model derived from the reference acoustic model, is identified. The desired number of components Nc is selected based on a computing environment in which the restructured acoustic model is to be deployed. The restructured acoustic model also has L states. For each given one of the L mixture models in the reference acoustic model, a merge sequence is built which records, for a given cost function, sequential mergers of pairs of the components associated with the given one of the mixture models. A portion of the Nc components is assigned to each of the L states in the restructured acoustic model. The restructured acoustic model is built by, for each given one of the L states in the restructured acoustic model, applying the merge sequence to a corresponding one of the L mixture models in the reference acoustic model until the portion of the Nc components assigned to the given one of the L states is achieved.
申请公布号 US2012150536(A1) 申请公布日期 2012.06.14
申请号 US20100964433 申请日期 2010.12.09
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 DOGNIN PIERRE;GOEL VAIBHAVA;HERSHEY JOHN R.;OLSEN PEDER A.
分类号 G10L15/00 主分类号 G10L15/00
代理机构 代理人
主权项
地址