发明名称 DOCUMENT CLASSIFICATION ASSISTING APPARATUS, METHOD AND PROGRAM
摘要 According to one embodiment, a document classification assisting apparatus includes an input unit, an extracting unit, an amount calculator, a setting unit, a calculator, and a storage. The input unit inputs documents including stroke information. The extracting unit extracts, from the stroke information, at least one of figure, annotation and text information. The amount calculator calculates, from the information extracted, feature amounts that enable comparison in similarity between the documents. The setting unit sets clusters including representative vectors that indicate features of the clusters and each include the feature amounts, and detects to which one of the clusters each of the documents belongs. The calculator calculates, as a classification rule, at least one of the feature amounts included in the representative vectors and characterizing the representative vectors. The storage stores the classification rule.
申请公布号 US2015199567(A1) 申请公布日期 2015.07.16
申请号 US201514668638 申请日期 2015.03.25
申请人 KABUSHIKI KAISHA TOSHIBA 发明人 Fume Kosei;Suzuki Masaru;Cho Kenta;Okamoto Masayuki
分类号 G06K9/00;G06K9/46;G06K9/62;G06K9/18 主分类号 G06K9/00
代理机构 代理人
主权项 1. A document classification assisting apparatus comprising: a document input unit configured to input a plurality of documents including stroke information; an extracting unit configured to extract, from the stroke information, at least one of figure information, annotation information and text information; a feature amount calculator configured to calculate, from the information extracted, feature amounts that enable comparison in similarity between the documents; a setting unit configured to set a plurality of clusters including representative vectors that indicate features of the clusters and each include the feature amounts, and to detect to which one of the clusters each of the documents belongs; a calculator configured to calculate, as a classification rule, at least one of the feature amounts included in the representative vectors and characterizing the representative vectors; and a storage configured to store the classification rule.
地址 Tokyo JP