发明名称 BOOTSTRAP AND ADAPT A DOCUMENT SEARCH ENGINE
摘要 Architecture that employs a modeling technique based on language modeling to estimate a probability of a document matching the user need as expressed in the query. The modeling technique is based on the data mining results that various portions of a document (e.g., body, title, URL, anchor text, user queries) use different styles of human languages. Thus, the results based on a language can be adapted individually to match the language of query. Since the approach is based on adaptation, the framework also provides a natural means to progressively revise the model as user data are collected. Different styles of languages in a document can be recognized and adapted individually. Background language models are also employed that offer a fallback approach in case the document has incomplete fields of data, and can utilize topical or semantic hierarchy of the knowledge domain.
申请公布号 US2011231394(A1) 申请公布日期 2011.09.22
申请号 US20100726358 申请日期 2010.03.18
申请人 MICROSOFT CORPORATION 发明人 WANG KUANSAN;HSU BO-JUNE;LI XIAOLONG
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址