发明名称 DOCUMENT ANALYSIS DEVICE AND DOCUMENT ANALYSIS PROGRAM
摘要 PROBLEM TO BE SOLVED: To provide a document analysis device and a program, in which a word causing rise or fall in order is extracted on the basis of order relations among categories.SOLUTION: A document analysis device according to the embodiment includes: a document storage part for storing document data; a category storage part for storing a plurality of categories with order which classifies document data and a hierarchical structure of the categories; and a factorial word extraction part for extracting, from a group of words included in a category which is one of a plurality of categories, a word whose frequency of appearance in the category is greater than a frequency of appearance of the word in another category belonging to the same layer to which the category belongs and whose frequency of appearance in the other category decreases further as the other category is deviated further, in terms of order, from the category.
申请公布号 JP2013190988(A) 申请公布日期 2013.09.26
申请号 JP20120056518 申请日期 2012.03.13
申请人 TOSHIBA CORP;TOSHIBA SOLUTIONS CORP 发明人 MIYABE YASUNARI;MATSUMOTO SHIGERU;GOTO KAZUYUKI;IWASAKI HIDEKI;KOBAYASHI MIKITO
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址
您可能感兴趣的专利