发明名称 Inference indexing
摘要 Methods, systems, and media are provided for facilitating generation of an inference index. In embodiments, a canonical entity is referenced. The canonical entity is associated with web documents. One or more queries that, when input, result in a selection of at least one of the web documents are identified. An entity document is generated for the canonical entity. The entity document includes the identified queries and/or associated text from the content of a document or from an entity title that result in the selection of the at least one of the web documents. The entity document and corresponding canonical entity can be combined with additional related entity documents and canonical entities to generate an inference index.
申请公布号 US8977625(B2) 申请公布日期 2015.03.10
申请号 US201012968481 申请日期 2010.12.15
申请人 Microsoft Technology Licensing, LLC 发明人 Buehrer Gregory T.;Jiang Li;Viola Paul Alfred;McGovern Andrew Paul;Szymanski Jakub Jan;Ahari Sanaz
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 代理人 Meyers Jessica;Barker Doug;Minhas Micky
主权项 1. A computer-implemented method of facilitating generation of an inference index using a computing system having processor, memory, and data storage subsystems, the computer-implemented method comprising: referencing a canonical entity that is associated with one or more web documents; identifying a plurality of queries that, when input, result in a selection of at least one web document of the one or more web documents associated with the canonical entity; generating, via the processor, an entity document for the canonical entity, the entity document including the plurality of identified queries and a representation of the at least one web document, wherein the plurality of identified queries resulted in the selection of the at least one web document corresponding with the canonical entity comprising a unique representation of an entity; generating an inference index using the canonical entity and the entity document along with other related canonical entities and corresponding entity documents, the inference index corresponding with a knowledge domain of related canonical entities; and utilizing the inference index in response to a real-time user query provided by a user after generation of the inference index to select a particular canonical entity that is most related to the real-time user query, the particular canonical entity comprising a unique representation of an entity that indicates a person or place and that is selected based on a cumulative score associated with the selected canonical entity being greater than cumulative scores associated with one or more other canonical entities within the inference index, wherein each of the cumulative scores for the canonical entities comprises an aggregate of entity document scores within the corresponding canonical entity, each entity document score calculated based on a frequency of at least a portion of the real-time user query occurring within queries of the corresponding entity document.
地址 Redmond WA US