发明名称 Systems and methods for classifying objects in digital images captured using mobile devices
摘要 A method includes receiving or capturing a digital image using a mobile device, and using a processor of the mobile device to: determine whether an object depicted in the digital image belongs to a particular object class among a plurality of object classes; determine one or more object features of the object based at least in part on the particular object class at least partially in response to determining the object belongs to the particular object class; build or select an extraction model based at least in part on the one or more determined object features; and extract data from the digital image using the extraction model. The extraction model excludes, and/or the extraction process does not utilize, optical character recognition (OCR) techniques. Related systems and computer program products are also disclosed.
申请公布号 US9311531(B2) 申请公布日期 2016.04.12
申请号 US201414209825 申请日期 2014.03.13
申请人 Kofax, Inc. 发明人 Amtrup Jan W.;Macciola Anthony;Thompson Stephen Michael;Ma Jiyong
分类号 G06K9/00 主分类号 G06K9/00
代理机构 Zilka-Kotab, PC 代理人 Zilka-Kotab, PC
主权项 1. A method, comprising: receiving or capturing a digital image using a mobile device; using a processor of the mobile device to: determine whether an object depicted in the digital image belongs to a particular object class among a plurality of object classes based on feature-space discrimination wherein the feature space discrimination utilizes one or more of support-vector-machine (SVM) techniques, transductive classification techniques, and maximum entropy discrimination (MED) techniques;determine one or more object features of the object based at least in part on the particular object class at least partially in response to determining the object belongs to the particular object class;build or select an extraction model based at least in part on the one or more determined object features; andextract data from the digital image using the extraction model, the extracting comprising detecting one or more lines of text in the object, and the detecting comprising: projecting the digital image onto a single dimension;projecting each color channel of the digital image onto a single channel along the single dimensiondetermining a distribution of light and dark areas along the projection;determining a plurality of dark pixel densities, each dark pixel density corresponding to a position along the projection;determining whether each dark pixel density is greater than a probable text line threshold; anddesignating each position as a text line upon determining the corresponding dark pixel density is greater than the probable text line threshold.
地址 Irvine CA US