发明名称 Database for mixed media document system
摘要 A Mixed Media Reality (MMR) system and associated techniques are disclosed. The MMR system provides mechanisms for forming a mixed media document that includes media of at least two types (e.g., printed paper as a first medium and digital content and/or web link as a second medium). In one particular embodiment, the MMR system includes a content-based retrieval database configured with an index table to represent two-dimensional geometric relationships between objects extracted from a printed document in a way that allows look-up using a text-based index. A ranked set of document, page and location hypotheses can be computed given data from the index table. The techniques effectively transform features detected in an image patch into textual terms (or other searchable features) that represent both the features themselves and the geometric relationship between them. A storage facility can be used to store additional characteristics about each document image patch.
申请公布号 US9405751(B2) 申请公布日期 2016.08.02
申请号 US200611461164 申请日期 2006.07.31
申请人 Ricoh Co., Ltd. 发明人 Hull Jonathan J.;Lee Dar-Shyang;Piersol Kurt W.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Patent Law Works LLP 代理人 Patent Law Works LLP
主权项 1. A database system for providing mixed media documents, comprising: one or more processors; an index table, stored on a memory and accessible by the one or more processors, that stores electronic descriptions of features extracted from paper documents, wherein the features include word bounding boxes, feature location information for the features, and association information for each of the paper documents and locations with a mixed media document that combines printed and digital media; a feature extraction module, stored on the memory and executable by the one or more processors to: receive an image patch;determine word bounding boxes from the image patch by aligning the image patch with a horizontal axis, detecting text lines in the image patch based on the aligned image patch, locating an area within each text line that is above a threshold as a word, and identifying the bounding boxes for words within the text lines;generate a query from the image patch, at least one query term of the query comprising a two-dimensional geometric relationship between the word bounding boxes determined from the image patch, the two-dimensional geometric relationship specifying one or more of a direction, an angle, a distance between the word bounding boxes determined from the image patch, and geometric shape and contour of the word bounding boxes; and an accumulator module, stored on the memory and executable by the one or more processors to: locate at least one mixed media document that contains the word bounding boxes determined from the image patch; anddetermine that the at least one mixed media document is a potential match to the query based on determining a two-dimensional geometric relationship between the features stored in the index table, comparing the two-dimensional geometric relationship between the word bounding boxes determined from the image patch with the two-dimensional geometric relationship between the features stored in the index table, computing a matching score for the at least one mixed media document, and returning the at least one mixed media document as a match to the query if the matching score is above a threshold.
地址 Tokyo JP