发明名称 INFORMATION RETRIEVAL AND SPEECH RECOGNITION BASED ON LANGUAGE MODELS
摘要 A language model (70) is used in a speech recognition system (60) which has access to a first, smaller data store (72) and a second, larger data store (74). The language model (70) is adapted by formulating an information retrieval query based on information contained in the first data store (72) and querying the second data store (74), Information retrieved from the second data store (74) is used in adapting the language model (70). Also, language models are used in retrieving information from the second data store (74). Language models are built based on information in the first data store (72), and based on information in the second data store (74). The perplexity of a document in the second data store (74) is determined, given the first language model, and given the second language model. Relevancy of the document is determined based upon the first and second perplexities. Documents are retrieved which have a relevancy measure that exceeds a threshold level.
申请公布号 CA2321112(C) 申请公布日期 2005.01.11
申请号 CA19992321112 申请日期 1999.02.09
申请人 MICROSOFT CORPORATION 发明人 HUANG, XUEDONG D.;MAHAJAN, MILIND V.
分类号 G06F17/30;G10L15/18;G10L15/22;(IPC1-7):G10L5/06 主分类号 G06F17/30
代理机构 代理人
主权项
地址