发明名称 |
DOCUMENT ANALYSIS APPARATUS AND PROGRAM |
摘要 |
A document analysis apparatus according to an embodiment an acquisition unit acquires a plurality of words by analyzing a text included in each of a plurality of documents stored in a document storage unit. A first determination unit determines, for each of the acquired words, the presence/absence of a correlation between the word and at least two attributes designated by a user out of a plurality of attributes of the plurality of documents stored in the document storage unit. A second determination unit determines whether a determination result by the first determination unit matches a pattern designated by the user out of a plurality of patterns stored in a pattern storage unit. A presentation unit presents a word whose determination result by the first determination unit is determined to match the pattern designated by the user. |
申请公布号 |
US2015199427(A1) |
申请公布日期 |
2015.07.16 |
申请号 |
US201514669721 |
申请日期 |
2015.03.26 |
申请人 |
Kabushiki Kaisha Toshiba ;Toshiba Solutions Corporation |
发明人 |
MIYABE Yasunari;MATSUMOTO Shigeru;GOTO Kazuyuki;IWASAKI Hideki;ISOBE Shozo |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A document analysis apparatus comprising:
a document storage unit which stores a plurality of documents each of which includes a text formed from a plurality of words, has a plurality of attributes, and includes attribute values of the attributes; a pattern storage unit which stores a plurality of patterns each representing presence/absence of a correlation between a word and each of at least two attributes out of the plurality of attributes; an acquisition unit which acquires a plurality of words by analyzing the text included in each of the plurality of documents stored in the document storage unit; a first determination unit which determines, for each of the acquired words, the presence/absence of the correlation between the word and at least two attributes designated by a user out of the plurality of attributes of the plurality of documents stored in the document storage unit; a second determination unit which determines whether a determination result by the first determination unit matches a pattern designated by the user out of the plurality of patterns stored in the pattern storage unit; and a presentation unit which presents a word whose determination result by the first determination unit is determined to match the pattern designated by the user. |
地址 |
Minato-ku JP |