发明名称 Identifying matching canonical documents in response to a visual query
摘要 A server system receives a visual query from a client system. The visual query is an image containing text such as a picture of a document. At the receiving server or another server, optical character recognition (OCR) is performed on the visual query to produce text recognition data representing textual characters. Each character in a contiguous region of the visual query is individually scored according to its quality. The quality score of a respective character is influenced by the quality scores of neighboring or nearby characters. Using the scores, one or more high quality strings of characters are identified. Each high quality string has a plurality of high quality characters. A canonical document containing the one or more high quality textual strings is retrieved. At least a portion of the canonical document is sent to the client system.
申请公布号 US9183224(B2) 申请公布日期 2015.11.10
申请号 US201012852189 申请日期 2010.08.06
申请人 Google Inc. 发明人 Petrou David;Popat Ashok C.;Casey Matthew R.
分类号 G06K9/18;G06F17/30;G06K9/03;G06K9/00;G06K9/72 主分类号 G06K9/18
代理机构 Fish & Richardson P.C. 代理人 Fish & Richardson P.C.
主权项 1. A computer-implemented method of processing a visual query comprising: on a server system having one or more processors and memory storing one or more programs for execution by the one or more processors: receiving a visual query from a client system;performing optical character recognition (OCR) on the visual query to produce text recognition data representing a plurality of textual characters, including a plurality of textual characters in a contiguous region of the visual query;scoring each textual character in the plurality of textual characters based on at least high quality textual character scores and low quality textual character scores of other textual characters surrounding the textual character, wherein the scoring of each textual character is based, in part, on a transition cost associated with each textual character such that each textual character will score more similarly to the other textual characters surrounding the textual character as the transition cost increases;identifying one or more high quality text segments in the contiguous region of the visual query in accordance with a determination that a respective high quality text segment of the one or more high quality text segments comprises a plurality of high scoring textual characters;identifying a document containing at least one high quality text segment of the one or more high quality text segments;retrieving the document containing the at least one high quality text segment; andsending at least a portion of the document to the client system.
地址 Mountain View CA US