发明名称 DOCUMENT INDEXING DEVICE, DOCUMENT INDEXING METHOD AND DOCUMENT INDEXING PROGRAM
摘要 PROBLEM TO BE SOLVED: To facilitate document text retrieval by a user by easily and automatically extracting keywords for large amounts of document texts, especially, existing Japanese document texts, and applying those keywords to the document texts. SOLUTION: The document indexing device is provided with: a character code identifying part (131) for identifying the character type of characters configuring a Japanese document text based on a character code from the text, and for respectively extracting a Kanji character string and a Katakana character string; character string appearance frequency counting parts (132, 134) for counting the appearance frequency of the extracted character string; and keyword generating parts (133, 135) for acquiring the character string whose appearance frequency is a predetermined rate or more to the total number of respective character strings in the Japanese document text as keywords. COPYRIGHT: (C)2007,JPO&INPIT
申请公布号 JP2007128224(A) 申请公布日期 2007.05.24
申请号 JP20050319454 申请日期 2005.11.02
申请人 RESEARCH ORGANIZATION OF INFORMATION & SYSTEMS;EXCELLEAD TECHNOLOGY:KK 发明人 SONEHARA NOBORU;KAMAE NAOHIKO;NUMATA HIDEHO;IKEDA YOSHIYO
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址