发明名称 DOCUMENT CLASSIFICATION DEVICE, DOCUMENT CLASSIFICATION METHOD, AND COMPUTER READABLE MEDIUM
摘要 A document classification device includes a characteristic extraction unit, a clustering unit, and a category update unit. The characteristic extraction unit extracts characteristic information from each of plural document data which are classified in advance into specific categories. The clustering unit classifies the document data with similar appearance frequency of the characteristic information into a same cluster. The category update unit assigns the document data which is classified into the same cluster with a category of different document data which is classified into the same cluster as a category of the document data.
申请公布号 US2015254332(A1) 申请公布日期 2015.09.10
申请号 US201514717034 申请日期 2015.05.20
申请人 FUJI XEROX CO., LTD. 发明人 HATTORI Keigo;MASUICHI Hiroshi
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A document classification device comprising: a characteristic extraction unit that extracts characteristic information from each of a plurality of document data which are classified in advance into specific categories; a clustering unit that classifies the document data with similar appearance frequency of the characteristic information into a same cluster; and a category update unit that assigns the document data which is classified into the same cluster with a category of different document data which is classified into the same cluster as a category of the document data.
地址 Tokyo JP