发明名称 |
AUTOMATIC DATA CLEANING FOR MACHINE LEARNING CLASSIFIERS |
摘要 |
Systems and techniques for improving the training of machine learning classifiers are disclosed. A classifier is trained using a set of validated documents that are accurately associated with a set of class labels. A subset of non-validated documents is also identified and is used to further train and improve accuracy of the classifier. |
申请公布号 |
EP2678806(A2) |
申请公布日期 |
2014.01.01 |
申请号 |
EP20120708205 |
申请日期 |
2012.02.21 |
申请人 |
THOMSON REUTERS GLOBAL RESOURCES |
发明人 |
MALIK, HASSAN, H.;OLOF-ORS, MANS |
分类号 |
G06K9/62;G06N5/00;G06N5/02 |
主分类号 |
G06K9/62 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|