发明名称 |
Camera Based Method For Text Input And Keyword Detection |
摘要 |
The present invention relates to a camera based method for text input and detection of a keyword or of a text-part within page or a screen comprising the steps of: directing a camera module on the printed page and capturing an image thereof; digital image filtering of the captured image; detection of word blocks contained in the image, each word block containing most likely a recognizable word; performing OCR within each word block; determination of A-blocks among the word blocks according to a keyword probability determination rule, wherein each of the A-blocks contains most likely the keyword; assignment of an attribute to each A-block; indication of the A-blocks in the display by a frame or the like for a further selection of the keyword; further selection of the A-block containing the keyword based on the displayed attribute of the keyword; forwarding the text content as text input to an application. |
申请公布号 |
US2015278621(A1) |
申请公布日期 |
2015.10.01 |
申请号 |
US201514639549 |
申请日期 |
2015.03.05 |
申请人 |
Nuance Communications, Inc. |
发明人 |
Goktekin Cuneyt;Tenchio Oliver |
分类号 |
G06K9/18;H04N7/18;H04N5/232;G06T5/00 |
主分类号 |
G06K9/18 |
代理机构 |
|
代理人 |
|
主权项 |
1. Camera based method for text input and detection of a keyword or of a text-part within a printed page or a screen containing the keyword/text-part comprising the steps of:
a) directing a camera module on the printed page or on the screen and continuously displaying the image acquired by the camera module; b) capturing an image with the camera module, containing the keyword or text-part therein and displaying the captured image; c) digital image filtering of the captured image including contrast enhancement, shadow compensation, unwarping and rotation of the captured image in order to obtain an artifact reduced image with a substantially horizontal text alignment; d) detection of word blocks contained in the artifact reduced image, each word block containing most likely a recognizable word; e) performing OCR within each word block to get its text content; f) determination of A-blocks among the word blocks according to a keyword probability determination rule, wherein each A-block containing most likely the keyword, such that the A-blocks are preselected among the word blocks for a further selection; g) assignment to each A-block an attribute, which can preferably be an enumerated number; h) indication of the A-blocks in the display by a frame or a background color and displaying their attributes as overlays within the artifact reduced and displayed image for the further selection of the keyword; i) further selection of an A-block containing the keyword based on the displayed attribute of the keyword; j) displaying the text content of the further selected A-block and forwarding the text content as text input to a current application running on a mobile communication device containing the camera or to a camera connected electronic device. |
地址 |
Burlington MA US |