摘要 |
<P>PROBLEM TO BE SOLVED: To provide a preparation method of a transposition index, capable of compressing a data size of the transposition index of N-gram, ciphered without using a language dictionary. <P>SOLUTION: In the transposition index preparation method for a ciphered document in a transposition index preparation system for preparing the transposition index of the ciphered document, a document is segmented by a difference of a character type, a token is generated by applying a token making rule different for each character type, respective generated tokens are ciphered, the ciphered tokens are replaced with compression codes and coded, and the transposition index for which the tokens indicated by the codes and the appearing position, the number of the document and position information are associated is prepared. <P>COPYRIGHT: (C)2013,JPO&INPIT |