发明名称 |
Registration method and search method for structured documents |
摘要 |
A registration/search method for structured documents where correspondence data is prepared between a fixed-length-string and a string occurrence position within a structured document for all fixed-length-strings in the document and for each structured document. A list of a character and all hierarchical elements containing the character and element lengths is prepared. An occurrence frequency and an occurrence position of a search term is obtained using the plurality of fixed-length-substrings and the occurrence frequency extracting index. A search character is selected from the search term. A hierarchical element containing the search character is obtained using the character from the element length index. A length of the element corresponding to a search range is extracted using the obtained occurrence position. A matching degree for the search term is calculated from the obtained occurrence frequency of the search term and the extracted element length of the element corresponding to the search range.
|
申请公布号 |
US6826567(B2) |
申请公布日期 |
2004.11.30 |
申请号 |
US20020218495 |
申请日期 |
2002.08.15 |
申请人 |
HITACHI, LTD. |
发明人 |
TADA KATSUMI;SUGAYA NATSUKO;MATSUBAYASHI TADATAKA;OKAMOTO TAKUYA;KAWASHIMO YASUSHI |
分类号 |
G06F17/30;(IPC1-7):G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|