发明名称 N-GRAM LANGUAGE MODEL COMPRESSION
摘要 <p>A method for compressing a language model that comprises a plurality of N-grams and associated N-gram probabilities. The method comprises forming at least one group of N-grams from the plurality of N-grams; sorting N-gram probabilities associated with the N-grams of the at least one group of N-grams; and determining a compressed representation of the sorted N-gram probabilities. The at least one group of N-grams may be formed from N-grams of the plurality of N- grams that are conditioned on the same (N-I) -tuple of preceding words . The compressed representation of the sorted N-grair. probabilities may be a sampled representation of the sorted N-gram probabilities or may comprise an index into a codebook. The invention further relates to an according computer program product and device, to a storage medium for at least partially storing a language model, and to a device for processing data at least partially based on a language model.</p>
申请公布号 WO2007039856(A1) 申请公布日期 2007.04.12
申请号 WO2006IB53538 申请日期 2006.09.28
申请人 NOKIA CORPORATION;OLSEN, JESPER 发明人 OLSEN, JESPER
分类号 主分类号
代理机构 代理人
主权项
地址