发明名称 |
INFORMATION RETRIEVAL SYSTEM, INFORMATION RETRIEVAL METHOD, STRUCTURAL ANALYSIS METHOD OF HTML DOCUMENT, AND PROGRAM |
摘要 |
PROBLEM TO BE SOLVED: To effectively realize a flexible information retrieval by multiple strategies in response to an intended purpose of information in an information retrieval using a computer. SOLUTION: This system is provided with a document structure analyzing part 12 which analyzes the structure of a HTML document in consideration of a meaning in a prescribed web page, a level-of-importance calculation part 13 which calculates the level of importance of other web site which is linked from this web page according to a prescribed strategy, and a crawling execution part 14 which crawls a website according to the level of importance calculated by the level-of-importance calculation part 13. COPYRIGHT: (C)2004,JPO
|
申请公布号 |
JP2004054631(A) |
申请公布日期 |
2004.02.19 |
申请号 |
JP20020211634 |
申请日期 |
2002.07.19 |
申请人 |
INTERNATL BUSINESS MACH CORP <IBM> |
发明人 |
NOMIYAMA HIROSHI;IWAO TOSHITAKA |
分类号 |
G06F17/30;G06F7/00;G06F12/00;G06F13/00;(IPC1-7):G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|