发明名称 Processing digital images including character recognition using ontological rules
摘要 Embodiments of methods, systems, and storage medium associated with processing of digital images including character recognition are disclosed herein. In one instance, the method may include identifying at least some components of a plurality of characters included in a digital image of content, based at least in part on comparison of a vector representation of each component with predefined component shape patterns; and determining one or more characters from the identified components. The determining may be based at least in part on evaluating the identified components using predetermined combination rules that define the one or more characters based at least in part on relationships between the one or more components in the identified plurality of characters. Other embodiments may be described and/or claimed.
申请公布号 US9208381(B1) 申请公布日期 2015.12.08
申请号 US201213714326 申请日期 2012.12.13
申请人 Amazon Technologies, Inc. 发明人 Shanmugasundaram Satishkumar Kothandapani;Jayakar Niranjan
分类号 G06K9/72;G06K9/00 主分类号 G06K9/72
代理机构 Schwabe Williamson & Wyatt PC 代理人 Schwabe Williamson & Wyatt PC
主权项 1. A computer-implemented method for processing a digital image including recognizing characters with one or more components, comprising: obtaining, with a computing device, a digital image of content including multiple characters, wherein a first character of the multiple characters includes components; generating, with the computing device, a vectorized version of the digital image of the content, wherein the vectorized version of the digital image has one or more vector representations corresponding to the components; identifying, with the computing device, a first component of the components in the digital image, the identifying including evaluating the vectorized representations of the one or more components with a neural network having been trained to recognize components by matching the first component to one or more predefined component shape patterns, wherein a first component shape pattern of the one or more component shape patterns visually represents a component shape; determining, with the computing device, one or more characters from the first component, the determining including evaluating the identified components using a plurality of combination rules configured to detect relationships between the first component and a second identified component based at least in part on spatial positions of the first component relative to the second component, and to determine the characters from the detected relationships, the determining further including determining the characters based in part on positions of the characters on a page of text in the digital image of the content; and sending, with the computing device, character codes for the determined one or more characters, the character codes specified by the plurality of combination rules.
地址 Reno NV US