发明名称 GENERATING STRUCTURED INFORMATION
摘要 Structured and/or unstructured data about enterprises are acquired from one or more sources such as commercial data providers, enterprise web sites, and/or directory web sites. Strings are extracted from the unstructured data. The strings contain key, value pairs describing facts about the enterprises. The extracted strings are parsed to normalize the keys and values and place them in a machine-understandable structured representation. Some keys and/or values cannot be normalized. The facts are clustered with the enterprise to which they pertain. Normalized facts from different sources are compared and confidence levels and/or weights are assigned to the facts. These confidence levels and weights are used to select the facts that are displayed on a page for the enterprise in a directory.
申请公布号 WO2006094206(A3) 申请公布日期 2006.11.23
申请号 WO2006US07639 申请日期 2006.03.02
申请人 GOOGLE, INC.;PASZTOR, EGON;EGNOR, DANIEL 发明人 PASZTOR, EGON;EGNOR, DANIEL
分类号 G06F7/00 主分类号 G06F7/00
代理机构 代理人
主权项
地址