发明名称 |
Systems, methods, and software for classifying documents |
摘要 |
To reduce cost and improve accuracy, the inventors devised systems, methods, and software to aid classification of text, such as headnotes and other documents, to target 5 classes in a target classification system. For example, one method comprises: identifying a first set of noun-word pairs in the input text, with the first set including at least one noun-word pair formed from a noun and non-adjacent word in the input text; identifying two or more second sets of noun-word pairs, with each second set including at least one noun-word pair formed from a noun and non-adjacent word in text 10 associated with a respective one of the target classes; determining a set of scores based on the first and second sets of noun-word pairs; and classifying or recommending classification of the input text to one or more of the target classes based on the set of scores. |
申请公布号 |
AU2009202974(B2) |
申请公布日期 |
2012.07.19 |
申请号 |
AU20090202974 |
申请日期 |
2009.07.23 |
申请人 |
THOMSON REUTERS GLOBAL RESOURCES |
发明人 |
AL-KOFAHI, KHALID;JACKSON, PETER;TRAVERS, TIMOTHY EARL;TYRELL, JAMES ALEXANDER |
分类号 |
G06F17/30;G06F7/00;G06K9/62;G06K9/68 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|