发明名称 METHOD, APPARATUS, AND SYSTEM FOR BUILDING A COMPACT LANGUAGE MODEL FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION (LVCSR) SYSTEM
摘要 According to one aspect of the invention, a method is provided in which a set of probabilistic attributes in an N-gram language model is classified into a plurality of classes. Each resultant class is clustered into a plurality of segments to build a codebook for the respective class using a modified K-means clustering process which dynamically adjusts the size and centroid of each segment during each iteration in the modified K-means clustering process. A probabilistic attribute in each class is then represented by the centroid of the corresponding segment to which the respective probabilistic attribute belongs.
申请公布号 WO02082310(A1) 申请公布日期 2002.10.17
申请号 WO2001CN00541 申请日期 2001.04.03
申请人 INTEL CORPORATION;INTEL CHINA LTD.;LAI, CHUNRONG;ZHAO, QINGWEI;PAN, JIELIN 发明人 LAI, CHUNRONG;ZHAO, QINGWEI;PAN, JIELIN
分类号 G10L15/06;G10L15/197;(IPC1-7):G06F17/20 主分类号 G10L15/06
代理机构 代理人
主权项
地址