发明名称 Camera Based Method For Text Input And Keyword Detection
摘要 The present invention relates to a camera based method for text input and detection of a keyword or of a text-part within page or a screen comprising the steps of: directing a camera module on the printed page and capturing an image thereof; digital image filtering of the captured image; detection of word blocks contained in the image, each word block containing most likely a recognizable word; performing OCR within each word block; determination of A-blocks among the word blocks according to a keyword probability determination rule, wherein each of the A-blocks contains most likely the keyword; assignment of an attribute to each A-block; indication of the A-blocks in the display by a frame or the like for a further selection of the keyword; further selection of the A-block containing the keyword based on the displayed attribute of the keyword; forwarding the text content as text input to an application.
申请公布号 US2015278621(A1) 申请公布日期 2015.10.01
申请号 US201514639549 申请日期 2015.03.05
申请人 Nuance Communications, Inc. 发明人 Goktekin Cuneyt;Tenchio Oliver
分类号 G06K9/18;H04N7/18;H04N5/232;G06T5/00 主分类号 G06K9/18
代理机构 代理人
主权项 1. Camera based method for text input and detection of a keyword or of a text-part within a printed page or a screen containing the keyword/text-part comprising the steps of: a) directing a camera module on the printed page or on the screen and continuously displaying the image acquired by the camera module; b) capturing an image with the camera module, containing the keyword or text-part therein and displaying the captured image; c) digital image filtering of the captured image including contrast enhancement, shadow compensation, unwarping and rotation of the captured image in order to obtain an artifact reduced image with a substantially horizontal text alignment; d) detection of word blocks contained in the artifact reduced image, each word block containing most likely a recognizable word; e) performing OCR within each word block to get its text content; f) determination of A-blocks among the word blocks according to a keyword probability determination rule, wherein each A-block containing most likely the keyword, such that the A-blocks are preselected among the word blocks for a further selection; g) assignment to each A-block an attribute, which can preferably be an enumerated number; h) indication of the A-blocks in the display by a frame or a background color and displaying their attributes as overlays within the artifact reduced and displayed image for the further selection of the keyword; i) further selection of an A-block containing the keyword based on the displayed attribute of the keyword; j) displaying the text content of the further selected A-block and forwarding the text content as text input to a current application running on a mobile communication device containing the camera or to a camera connected electronic device.
地址 Burlington MA US