发明名称 Registration method and search method for structured documents
摘要 A registration/search method for structured documents where correspondence data is prepared between a fixed-length-string and a string occurrence position within a structured document for all fixed-length-strings in the document and for each structured document. A list of a character and all hierarchical elements containing the character and element lengths is prepared. An occurrence frequency and an occurrence position of a search term is obtained using the plurality of fixed-length-substrings and the occurrence frequency extracting index. A search character is selected from the search term. A hierarchical element containing the search character is obtained using the character from the element length index. A length of the element corresponding to a search range is extracted using the obtained occurrence position. A matching degree for the search term is calculated from the obtained occurrence frequency of the search term and the extracted element length of the element corresponding to the search range.
申请公布号 US6826567(B2) 申请公布日期 2004.11.30
申请号 US20020218495 申请日期 2002.08.15
申请人 HITACHI, LTD. 发明人 TADA KATSUMI;SUGAYA NATSUKO;MATSUBAYASHI TADATAKA;OKAMOTO TAKUYA;KAWASHIMO YASUSHI
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址