发明名称 Method of Perspective Correction For Devanagari Text
摘要 An electronic device and method identify regions that are likely to be text in a natural image or video frame, followed by processing as follows: lines that are nearly vertical are automatically identified in a selected text region, oriented relative to the vertical axis within a predetermined range −max_theta to +max_theta, followed by determination of an angle θ of the identified lines, followed by use of the angle θ to perform perspective correction by warping the selected text region. After perspective correction in this manner, each text region is processed further, to recognize text therein, by performing OCR on each block among a sequence of blocks obtained by slicing the potential text region. Thereafter, the result of text recognition is used to display to the user, either the recognized text or any other information obtained by use of the recognized text.
申请公布号 US2014161365(A1) 申请公布日期 2014.06.12
申请号 US201313842985 申请日期 2013.03.15
申请人 QUALCOMM INCORPORATED 发明人 Acharya Hemanth P.;Baheti Pawan Kumar
分类号 G06K9/00 主分类号 G06K9/00
代理机构 代理人
主权项 1. A method to improve automatic recognition of text, the method comprising: receiving a plurality of regions of text in an image of a scene of real world captured by a camera; wherein a plurality of pixels of a common binary value, in a word in a region in said plurality of regions of text, are arranged along a first line oriented in a predetermined direction; wherein a first height at a first end of said word along said predetermined direction is different from a second height at a second end of said word along said predetermined direction; detecting a plurality of second lines that satisfy at least a predetermined test and pass through a portion of the word having a predetermined relationship to said first line; determining an angle θ based on a plurality of angles of the plurality of second lines relative to a common direction; using the angle θ to change first coordinates of at least said plurality of pixels in said word, whereby the first height and the second height remain unchanged after the using; and storing in a memory, at least changed first coordinates generated by the using; wherein the receiving, the processing, the determining, the using and the storing are performed by one or more processors.
地址 San Diego CA US