发明名称 Sharpness-based frame selection for OCR
摘要 A system to select video frames for optical character recognition (OCR) based on feature metrics associated with blur and sharpness. A device captures a video frame including text characters. An edge detection filter is applied to the frame to determine gradient features in perpendicular directions. An “edge map” is created from the gradient features, and points along edges in the edge map are identified. Edge transition widths are determined at each of the edge points based in local intensity minimum and maximum on opposite sides of the respective edge point in the frame. Sharper edges have smaller edge transition widths than blurry images. Statistics are determined from the edge transition widths, and the statistics are processed by a trained classifier to determine if the frame is or is not sufficiently sharp for text processing.
申请公布号 US9576210(B1) 申请公布日期 2017.02.21
申请号 US201414500005 申请日期 2014.09.29
申请人 AMAZON TECHNOLOGIES, INC. 发明人 Liu Yue;Yu Qingfeng;Liu Xing;Natarajan Pradeep
分类号 G06K9/18;G06K9/46;G06T5/00;G06T11/60;G06T5/10 主分类号 G06K9/18
代理机构 Seyfarth Shaw LLP 代理人 Seyfarth Shaw LLP ;Barzilay Ilan N.;Cartwright Tyrus S.
主权项 1. A system comprising: a camera; at least one processor; a display; and a first memory including instructions operable to be executed by the at least one processor to perform a set of actions to configure the at least one processor to: capture a frame of video via the camera;apply an edge detection filter to the captured frame to determine image intensity gradients in horizontal and vertical directions;combine the horizontal and vertical image intensity gradients at each point in the frame to produce an edge map of the frame, wherein each point of the edge map corresponds to a point in the captured frame;select a plurality of edge points of the captured frame by comparing corresponding points of the edge map to a threshold;determine an edge transition width for each edge point of the plurality of edge points by: determine a local intensity maximum and a local intensity minimum along a line intersecting the edge point, andcomputing the edge transition width as a distance between the local intensity minimum and the local intensity maximum along the line intersecting the edge point;determine a first statistic using the edge transitions width for each edge point of the plurality of edge points;determine a sharpness quality of the frame using the first statistic;based on the sharpness quality, transmit the frame to a server computing device for optical character recognition (OCR) processing;receive OCR results corresponding to text identified in the frame;determine, from the plurality of edge points, a set of edge points that are proximate to the text;determine respective edge transition widths for each edge point of the set of edge points;determine a second statistic using the respective edge transition widths;perform OCR on the text based on the second statistic; andoutput the OCR results via the display.
地址 Seattle WA US
您可能感兴趣的专利