发明名称 |
Method and system for using OCR data for grouping and classifying documents |
摘要 |
A document template for classifying documents is created for each document class. The document template includes a set of keywords and the spatial relations of the keywords. A document to be classified is received. The spatial relations of the template keywords of a template are compared with the spatial relations of corresponding words in the document. If the spatial relations are the same, the document may be classified in the document class of the template. |
申请公布号 |
US8724907(B1) |
申请公布日期 |
2014.05.13 |
申请号 |
US201213432251 |
申请日期 |
2012.03.28 |
申请人 |
SAMPSON STEVEN;PRUDENT YANN;EMC CORPORATION |
发明人 |
SAMPSON STEVEN;PRUDENT YANN |
分类号 |
G06K9/68 |
主分类号 |
G06K9/68 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|