发明名称 Methods and apparatus for automated image classification
摘要 A system for automated classification of an image of an electronic document such as a facsimile document. The image is converted to a textual representation, and at least some of the terms in the textual representation may be associated with one or more predefined classification types, thereby enabling the document to be classified, and for multi-page documents, determining boundaries used to split the document into sections. The development of associations between terms and classification types may result from providing, to the system, a training set of manually-classified documents. A training module analyzes the training set to calculate probabilities that particular terms may appear in documents of a particular classification type. Probabilities established during training are used during automated document processing to assign a classification type to the document. A confidence score associated with the assigned classification type provides a metric for assessing the accuracy of the automated process.
申请公布号 US8671112(B2) 申请公布日期 2014.03.11
申请号 US20080138181 申请日期 2008.06.12
申请人 AMAR ANSHUL;SALLASKA NYE JOREL;ATHENAHEALTH, INC. 发明人 AMAR ANSHUL;SALLASKA NYE JOREL
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址