发明名称 Systems and methods for mobile image capture and processing
摘要 In various embodiments, methods, systems, and computer program products for processing digital images captured by a mobile device are disclosed. Myriad features enable and/or facilitate processing of such digital images using a mobile device that would otherwise be technically impossible or impractical, and furthermore address unique challenges presented by images captured using a camera rather than a traditional flat-bed scanner, paper-feed scanner, or multifunction peripheral. Particularly advantageous features include robustly detecting edges of one or more documents depicted in the digital image data, and defining/locating document pages at least partially on this basis. The statistical approaches employed enable robust yet computationally efficient techniques to accomplish page detection, and associated functions, using hardware typically included in mobile devices and within practical (especially temporal) limits imposed by device manufacturers, users, associated and/or downstream computational and/or business processes.
申请公布号 US8971587(B2) 申请公布日期 2015.03.03
申请号 US201414334558 申请日期 2014.07.17
申请人 Kofax, Inc. 发明人 Macciola Anthony;Shustorovich Alexander;Thrasher Christopher W.
分类号 G06K9/00;G06T7/40;H04N1/40 主分类号 G06K9/00
代理机构 Zilka-Kotab, PC 代理人 Zilka-Kotab, PC
主权项 1. A method, comprising: capturing one or more of image data depicting a digital representation of a document and audio data relating to the digital representation of the document; defining a plurality of candidate edge points within the image data; removing one or more outlier candidate edge points from the plurality of candidate edge points; defining a second plurality of candidate edge points excluding the one or more outlier candidate edge points; and defining four sides of a tetragon based on one or more of the plurality of candidate edge points and the second plurality of candidate edge points, wherein each side of the tetragon corresponds to a different side of the document, and wherein the tetragon bounds the digital representation of the document.
地址 Irvine CA US