发明名称 Reducing human overhead in text categorization
摘要 A unique multi-stage classification system and method that facilitates reducing human resources or costs associated with text classification while still obtaining a desired level of accuracy is provided. The multi-stage classification system and method involve a pattern-based classifier and a machine learning classifier. The pattern-based classifier is trained on discriminative patterns as identified by humans rather than machines which allow a smaller training set to be employed. Given humans' superior abilities to reason over text, discriminative patterns can be more accurately and more readily identified by them. Unlabeled items can be initially processed by the pattern-based classifier and if no pattern match exists, then the unlabeled data can be processed by the machine learning classifier. By employing the classifiers in this manner, less human involvement is required in the classification process. Even more, classification accuracy is maintained and/or improved.
申请公布号 US7894677(B2) 申请公布日期 2011.02.22
申请号 US20060350701 申请日期 2006.02.09
申请人 MICROSOFT CORPORATION 发明人 KOENIG ARND CHRISTIAN;BRILL ERIC D.
分类号 G06K9/64 主分类号 G06K9/64
代理机构 代理人
主权项
地址