发明名称 FINDING AND DISAMBIGUATING REFERENCES TO ENTITIES ON WEB PAGES
摘要 A system and method for disambiguating references to entities in a document. In one embodiment, an iterative process is used to disambiguate references to entities in documents. An initial model is used to identify documents referring to an entity based on features contained in those documents. The occurrence of various features in these documents is measured. From the number occurrences of features in these documents, a second model is constructed. The second model is used to identify documents referring to the entity based on features contained in the documents. The process can be repeated, iteratively identifying documents referring to the entity and improving subsequent models based on those identifications. Additional features of the entity can be extracted from documents identified as referring to the entity.
申请公布号 US2014379743(A1) 申请公布日期 2014.12.25
申请号 US201414457869 申请日期 2014.08.12
申请人 Google Inc. 发明人 Laroco, Jr. Leonardo A.;Jevtic Nikola;Yakovenko Nikolai V.;Reynar Jeffrey
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method for identifying texts referring to an entity, the entity being associated with a first set of features, the method comprising: at a computer having one or more processors and memory storing programs for execution by the one or more processors: identifying a first set of text as associated with the entity in accordance with a first set of features that are sufficient for identifying a document referring to the entity;identifying a second set of text as associated with the entity in accordance with a second set of features that are sufficient for identifying a document referring to the entity, wherein the second set of feature is distinct from the first set of features;identifying a representative feature associated with the entity, in accordance with the first set of features and the second set of features; wherein the first set of text and the second set of text are identified from a same audio file.
地址 Mountain View CA US
您可能感兴趣的专利