发明名称 |
Systems, methods, and software for classifying documents |
摘要 |
<p>To reduce cost and improve accuracy, the inventors devised systems, methods, and software to aid classification of text, such as headnotes and other documents, to target classes in a target classification system. For example, one system computes composite scores based on: similarity of input text to text assigned to each of the target classes; similarly of non-target classes assigned to the input text and target classes; probability of a target class given a set of one or more non-target classes assigned to the input text; and/or probability of the input text given text assigned to the target to the target classes. The exemplary system then evaluates the composite scores using class-specific decision criteria, such as thresholds, ultimately assigning or recommending assignment of the input text to one or more of the target classes. The exemplary system is particularly suitable for classification systems having thousands of classes.</p> |
申请公布号 |
EP2012240(A1) |
申请公布日期 |
2009.01.07 |
申请号 |
EP20080017291 |
申请日期 |
2002.11.01 |
申请人 |
THOMSON REUTERS GLOBAL RESOURCES |
发明人 |
AL-KOFAHI, KHALID;JACKSON, PETER;TRAVERS, TIMOTHY, EARL;TYRELL, ALEX |
分类号 |
G06F17/30;G06F7/00;G06K9/62;G06K9/68 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|