发明名称 Near duplicate images
摘要 Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining image search results. One of the methods includes generating a plurality of feature vectors for each image in a collection of images, wherein each feature vector is associated with an image tile of an image, wherein each feature vector corresponds to one of a plurality of predetermined visual words. All images in the collection of images that share at least a threshold number of matching visual words associated with matching image tiles are classified as near-duplicate images.
申请公布号 US9063954(B2) 申请公布日期 2015.06.23
申请号 US201313832122 申请日期 2013.03.15
申请人 Google Inc. 发明人 Ioffe Sergey;Aly Mohamed;Rosenberg Charles J.
分类号 G06F17/30;G06K9/46;G06K9/62 主分类号 G06F17/30
代理机构 Fish & Richardson P.C. 代理人 Fish & Richardson P.C.
主权项 1. A computer-implemented method comprising: generating a plurality of feature vectors for each image in a collection of images, wherein each feature vector is associated with an image tile of an image, wherein each feature vector corresponds to one of a plurality of predetermined visual words and wherein generating a feature vector for a particular image in the collection of images comprises: determining a feature region in the particular image;computing the feature vector from the feature region in the particular image;quantizing the feature vector to one of the plurality of visual words;determining an image tile to which the feature region is located;associating the visual word with the image tile for the feature region; and classifying as near-duplicate images all images in the collection of images that share at least a threshold number of matching visual words associated with matching image tiles.
地址 Mountain View CA US