发明名称 |
Session-based character recognition for document reconstruction |
摘要 |
Systems and methods are provided for sharing a screen from a mobile device. For example, a method includes receiving an image from a mobile device, performing recognition on the image to identify space-delimited strings, and generating a content graph for the image, the content graph having content nodes that represent at least some of the strings and the content graph having edges that represent a relative position of strings associated with the content nodes connected by the edges. The method may also include repeating the receiving, performing recognition, and generating for a plurality of images, the plurality of images belonging to a session, and generating a combined graph from the plurality of content graphs based on similarity of content nodes between content graphs, the combined graph representing text from the plurality of images in reading order. |
申请公布号 |
US9424668(B1) |
申请公布日期 |
2016.08.23 |
申请号 |
US201414532692 |
申请日期 |
2014.11.04 |
申请人 |
Google Inc. |
发明人 |
Petrou David;Chaudhury Krishnendu;Goschin Sergiu;Bridges Matthew John |
分类号 |
G06T11/20;G06T11/60 |
主分类号 |
G06T11/20 |
代理机构 |
Brake Hughes Bellermann LLP |
代理人 |
Brake Hughes Bellermann LLP |
主权项 |
1. A mobile device comprising:
at least one processor; and memory storing instructions that, when executed by the at least one processor, cause the mobile device to:
for a plurality of images associated with a session:
perform recognition on the image to identify space-delimited strings, andgenerate a content graph for the image, the content graph having content nodes that represent at least some of the strings, an edge in the graph representing a relative position of the content nodes the edge connects;generate a combined graph from the plurality of content graphs by clustering content nodes between the content graphs based on similarity metrics, wherein a cluster represents a node in the combined graph; andselect a best string from the cluster as a label for the node in the combined graph, wherein the combined graph represents text from the plurality of images in reading order. |
地址 |
Mountain View CA US |