发明名称 INFORMATION RETRIEVAL SYSTEM, INFORMATION RETRIEVAL METHOD, STRUCTURAL ANALYSIS METHOD OF HTML DOCUMENT, AND PROGRAM
摘要 PROBLEM TO BE SOLVED: To effectively realize a flexible information retrieval by multiple strategies in response to an intended purpose of information in an information retrieval using a computer. SOLUTION: This system is provided with a document structure analyzing part 12 which analyzes the structure of a HTML document in consideration of a meaning in a prescribed web page, a level-of-importance calculation part 13 which calculates the level of importance of other web site which is linked from this web page according to a prescribed strategy, and a crawling execution part 14 which crawls a website according to the level of importance calculated by the level-of-importance calculation part 13. COPYRIGHT: (C)2004,JPO
申请公布号 JP2004054631(A) 申请公布日期 2004.02.19
申请号 JP20020211634 申请日期 2002.07.19
申请人 INTERNATL BUSINESS MACH CORP <IBM> 发明人 NOMIYAMA HIROSHI;IWAO TOSHITAKA
分类号 G06F17/30;G06F7/00;G06F12/00;G06F13/00;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址