发明名称 |
DEVICE AND METHOD FOR RETRIEVING SIMILAR DOCUMENT BY STRUCTURE SPECIFICATION |
摘要 |
PROBLEM TO BE SOLVED: To add the specification of an object structure to be retrieved to retrieval conditions and to improve the retrieval precision when a document which is similar to a seed document (document specified as a retrieval condition) is retrieved. SOLUTION: A retrieval condition expression analyzing program 130 receives the specification of a seed document and the input of an object structure to be retrieved as retrieval conditions. A featured character string extracting program 150 extracts a featured character string from the text of the specified seed document. A retrieval object structure ID acquiring program 151 converts the specified structure into its ID. A similarity calculating program 152 performs retrieval from an appearance frequency file 181 to acquire the appearance frequency of a document whose structure ID matches the featured character string and calculates the similarity of the similar document based upon the seed document. A retrieval result output program 132 displays the identifier and similarity of the similar document as the retrieval result. |
申请公布号 |
JP2001014326(A) |
申请公布日期 |
2001.01.19 |
申请号 |
JP19990183349 |
申请日期 |
1999.06.29 |
申请人 |
HITACHI LTD |
发明人 |
MATSUBAYASHI TADATAKA;TADA KATSUMI;SUGAYA NATSUKO;INABA YASUHIKO;YAMAGUCHI AKIHIKO;GOCHI YOSUKE |
分类号 |
G06F17/21;G06F17/27;G06F17/30 |
主分类号 |
G06F17/21 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|