发明名称 |
METHOD, APPARATUS, AND SYSTEM FOR BUILDING A COMPACT MODEL FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION (LVCSR) SYSTEM |
摘要 |
According to one aspect of the invention, a method is provided in which a mean vector set and a variance vector set of a set of N Gaussians are divided into multiple mean sub-vector sets and variance sub-vector sets, respectively. Each mean sub-vector set contains a subset of the dimensions of the corresponding mean vector set and each variance sub-vector set contains a subset of the dimensions of the corresponding variance vector set. Each resultant sub-vector set is clustered to build a codebook for the respective sub-vector set using a modified K-means clustering process which dynamically merges and splits clusters based upon the size and average distortion of each cluster during each iteration in the modified K-means clustering process.
|
申请公布号 |
WO0229617(A1) |
申请公布日期 |
2002.04.11 |
申请号 |
WO2000CN00306 |
申请日期 |
2000.09.30 |
申请人 |
INTEL CORPORATION;PAN, JIELIN;YUAN, BAOSHENG |
发明人 |
PAN, JIELIN;YUAN, BAOSHENG |
分类号 |
G06F17/20;G10L15/06;(IPC1-7):G06F17/20 |
主分类号 |
G06F17/20 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|