发明名称 Language model compression
摘要 A method for compressing a language model that comprises a plurality of N-grams and associated N-gram probabilities. The method comprises forming at least one group of N-grams from the plurality of N-grams; sorting N-gram probabilities associated with the N-grams of the at least one group of N-grams; and determining a compressed representation of the sorted N-gram probabilities. The at least one group of N-grams may be formed from N-grams of the plurality of N-grams that are conditioned on the same (N-1)-tuple of preceding words. The compressed representation of the sorted N-gram probabilities may be a sampled representation of the sorted N-gram probabilities or may comprise an index into a codebook. The invention further relates to an according computer program product and device, to a storage medium for at least partially storing a language model, and to a device for processing data at least partially based on a language model.
申请公布号 US2007078653(A1) 申请公布日期 2007.04.05
申请号 US20050243447 申请日期 2005.10.03
申请人 NOKIA CORPORATION 发明人 OLSEN JESPER
分类号 G10L15/00 主分类号 G10L15/00
代理机构 代理人
主权项
地址
您可能感兴趣的专利