发明名称 IDENTIFYING MATCHING CANONICAL DOCUMENTS IN RESPONSE TO A VISUAL QUERY
摘要 <p>A server system receives a visual query from a client system. The visual query is an image containing text such as a picture of a document. At the receiving server or another server, optical character recognition (OCR) is performed on the visual query to produce text recognition data representing textual characters. Each character in a contiguous region of the visual query is individually scored according to its quality. The quality score of a respective character is influenced by the quality scores of neighboring or nearby characters. Using the scores, one or more high quality strings of characters are identified. Each high quality string has a plurality of high quality characters. A canonical source document matching the visual query that contains the one or more high quality textual strings is identified and retrieved. Then at least a portion of the canonical document is sent to the client system.</p>
申请公布号 CA2819369(A1) 申请公布日期 2012.06.07
申请号 CA20112819369 申请日期 2011.12.01
申请人 GOOGLE INC. 发明人 PETROU, DAVID;POPAT, ASHOK C.;CASEY, MATTHEW R.
分类号 G06K9/72;G06F17/30 主分类号 G06K9/72
代理机构 代理人
主权项
地址