发明名称 METHOD AND APPARATUS FOR CREATING SAMPLE CHARACTER STRING DICTIONARY
摘要 <p>PROBLEM TO BE SOLVED: To improve locality of reference when restoring data.SOLUTION: In a method and an apparatus for creating sample character string dictionary, appearance frequencies of substrings in an input character string N are counted, and the substrings with the appearance frequencies of predetermined times k or more are combined before being stored in sample character string storing means. Then, a maximum matching length Lbetween a substring starting from an i-th starting position of the input character string N and a sample character string M, and an appearance position Pof the substring are calculated. Also, a maximum matching length Lbetween a substring starting from the i-th starting point of the input character string N and a substring appeared between 0 to (i-1)th in the input character string N, and an appearance position of the appeared substring are calculated. In the case of L<L, a replace pointer is set to indicate a past position of the input character string N, and in the case of L≥L, the replace pointer is set to indicate a position on the sample character string M. Thereafter, the substring of [i...i+L+1] in the input character string N is set as the substring already appeared and stored in dynamic dictionary storing means, before outputting a replace pointer string and the sample character string.</p>
申请公布号 JP2014204357(A) 申请公布日期 2014.10.27
申请号 JP20130080292 申请日期 2013.04.08
申请人 NIPPON TELEGR & TELEPH CORP <NTT> 发明人 YAMAMURO TAKESHI;KONISHI FUMIKAZU
分类号 H03M7/42 主分类号 H03M7/42
代理机构 代理人
主权项
地址