摘要 |
PROBLEM TO BE SOLVED: To facilitate document text retrieval by a user by easily and automatically extracting keywords for large amounts of document texts, especially, existing Japanese document texts, and applying those keywords to the document texts. SOLUTION: The document indexing device is provided with: a character code identifying part (131) for identifying the character type of characters configuring a Japanese document text based on a character code from the text, and for respectively extracting a Kanji character string and a Katakana character string; character string appearance frequency counting parts (132, 134) for counting the appearance frequency of the extracted character string; and keyword generating parts (133, 135) for acquiring the character string whose appearance frequency is a predetermined rate or more to the total number of respective character strings in the Japanese document text as keywords. COPYRIGHT: (C)2007,JPO&INPIT
|