发明名称 METHOD AND SYSTEM OF GENERATING AN AGGREGATE WEBSITE SEARCH DATABASE USING SMART INDEXES FOR SEARCHING
摘要 Signature schema documents may be pre-defined using query language to provide instructions for application by an engine to extract data from web pages of respective web sites. For a particular web page, signature schema instructions identify a web page family for the web page and extract desired data from the web page in accordance with its web page family. The instructions use signatures previously identified within web pages of the same family to distinguish the web page family from others of the web site and to distinguish the desired data from other data for the web page family. A server may make one or more requests to obtain web pages from various web sites and apply respective signature schemas maintained in a repository coupled to the engine. Indexes can be generated based upon the pre-defined data relationships to improve search capability. Extracted data and indexes can be stored to an aggregate database.
申请公布号 US2008288477(A1) 申请公布日期 2008.11.20
申请号 US20080119338 申请日期 2008.05.12
申请人 KIM SANG-HEUN;STINSON CHARLES LAURENCE 发明人 KIM SANG-HEUN;STINSON CHARLES LAURENCE
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址