发明名称 Recording medium and character string collating apparatus for full-text character data
摘要 All two-character chains including two general characters and all three-character chains including one special character between two general characters are detected from a registration character string in which a large number of special characters not having any meaning are frequently arranged, or all two-character chains including two general or symbolic characters are detected from a converted registration character string produced by changing each special character of the registration character string to one type of symbolic character determined in correspondence to one general character adjacent to the special character. Also, occurrence frequencies of the general or symbolic characters of each chain are counted and stored in a recording medium with the registration character chains. When a retrieval character chain is input, occurrence frequencies of particular character chains corresponding to all retrieval character chains detected from the retrieval character string in the same manner are read out from the recording medium and are collated with each other, and a particular character string agreeing with the retrieval character chain is retrieved from the registration character string. Because an occurrence frequency of any special character is not counted or the special characters are changed to various types of symbolic characters, a recording area required for the occurrence frequencies of the registration character chains can be reduced.
申请公布号 US6260051(B1) 申请公布日期 2001.07.10
申请号 US19980114284 申请日期 1998.07.13
申请人 MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. 发明人 KATAYAMA OSAMU;OYAMA TAKAMASA;KIKUCHI CHUICHI;FUJITA TOMOKO;SHIRASAKI YASUYO
分类号 G06F17/30;(IPC1-7):G06F17/21;G06F17/22;G06F17/27 主分类号 G06F17/30
代理机构 代理人
主权项
地址