发明名称 Auto-Maintained Document Classification
摘要 Machines, systems and methods for maintaining a representative data set in a document classification system, the method comprising: including an initial set of seed representative data in a representative data set (RDS) implemented for a knowledge base (KB), wherein the KB is trained to classify documents provided to a document classification system based on analysis of the representative documents included in the RDS and a set of rules, wherein the seed representative data includes a balanced number of representative data across a plurality of classes; updating the RDS by adding or removing representative data from the RDS based on feedback received about accuracy of classification of one or more documents by the classification system; and retraining the KB, wherein the retraining is performed based on occurrence of one or more events.
申请公布号 US2014348419(A1) 申请公布日期 2014.11.27
申请号 US201313900605 申请日期 2013.05.23
申请人 International Business Machines Corporation 发明人 Dayan Yigal S.;Fuchs Gil;Magdalen Josemina M.;Maharian Irit;Tzaban Yariv
分类号 G06K9/62 主分类号 G06K9/62
代理机构 代理人
主权项
地址 Armonk NY US