发明名称 |
METHOD, APPARATUS, AND SYSTEM FOR BUILDING A COMPACT LANGUAGE MODEL FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION (LVCSR) SYSTEM |
摘要 |
According to one aspect of the invention, a method is provided in which a set of probabilistic attributes in an N-gram language model is classified into a plurality of classes. Each resultant class is clustered into a plurality of segments to build a codebook for the respective class using a modified K-means clustering process which dynamically adjusts the size and centroid of each segment during each iteration in the modified K-means clustering process. A probabilistic attribute in each class is then represented by the centroid of the corresponding segment to which the respective probabilistic attribute belongs.
|
申请公布号 |
WO02082310(A1) |
申请公布日期 |
2002.10.17 |
申请号 |
WO2001CN00541 |
申请日期 |
2001.04.03 |
申请人 |
INTEL CORPORATION;INTEL CHINA LTD.;LAI, CHUNRONG;ZHAO, QINGWEI;PAN, JIELIN |
发明人 |
LAI, CHUNRONG;ZHAO, QINGWEI;PAN, JIELIN |
分类号 |
G10L15/06;G10L15/197;(IPC1-7):G06F17/20 |
主分类号 |
G10L15/06 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|