发明名称 Camera based method for text input and keyword detection
摘要 The present invention relates to a camera based method for text input and detection of a keyword or of a text-part within page or a screen comprising the steps of: directing a camera module on the printed page and capturing an image thereof; digital image filtering of the captured image; detection of word blocks contained in the image, each word block containing most likely a recognizable word; performing OCR within each word block; determination of A-blocks among the word blocks according to a keyword probability determination rule, wherein each of the A-blocks contains most likely the keyword; assignment of an attribute to each A-block; indication of the A-blocks in the display by a frame or the like for a further selection of the keyword; further selection of the A-block containing the keyword based on the displayed attribute of the keyword; forwarding the text content as text input to an application.
申请公布号 US9589198(B2) 申请公布日期 2017.03.07
申请号 US201514639549 申请日期 2015.03.05
申请人 Nuance Communications, Inc. 发明人 Goktekin Cuneyt;Tenchio Oliver
分类号 H04N5/228;G06K9/18;G06K9/00;G06K9/32;H04N1/00;H04N1/32;G06T5/00;H04N5/232;H04N7/18 主分类号 H04N5/228
代理机构 Hamilton, Brook, Smith & Reynolds, P.C. 代理人 Hamilton, Brook, Smith & Reynolds, P.C.
主权项 1. A method for text input and detection of a keyword within a printed page or a screen, the method comprising: detecting a plurality of word blocks contained in a captured image of a printed page or a screen; determining candidate keyword blocks among the plurality of word blocks according to a keyword probability determination rule that results in a respective probability value for each word block, wherein the keyword probability determination rule is based at least in part on a spatial analysis of the word blocks relative to at least a portion of the captured image, the spatial analysis being relative to an indication of a user intention of a keyword to obtain for selection and where the candidate keyword blocks are identified among the plurality of word blocks based upon each respective probability value of the candidate keyword blocks being above a threshold; and upon selection of a candidate keyword block, forwarding content of the selected keyword block as text input to an application.
地址 Burlington MA US