发明名称 Retrieval system and method leveraging category-level labels
摘要 An instance-level retrieval method and system are provided. A representation of a query image is embedded in a multi-dimensional space using a learned projection. The projection is learned using category-labeled training data to optimize a classification rate on the training data. The joint learning of the projection and the classifiers improves the computation of similarity/distance between images by embedding them in a subspace where the similarity computation outputs more accurate results. An input query image can thus be used to retrieve similar instances in a database by computing the comparison measure in the embedding space.
申请公布号 US9075824(B2) 申请公布日期 2015.07.07
申请号 US201213458183 申请日期 2012.04.27
申请人 XEROX CORPORATION 发明人 Gordo Albert;Rodriguez Serrano Jose Antonio;Perronnin Florent
分类号 G06F15/18;G06F17/30;G06K9/62 主分类号 G06F15/18
代理机构 Fay Sharpe LLP 代理人 Fay Sharpe LLP
主权项 1. A retrieval method comprising: learning a projection for embedding an original image representation in an embedding space, the original image representation being based on features extracted from the image, the projection being learned from category-labeled training data to optimize a classification rate on the training data, the learning of the projection including, for a plurality of iterations: selecting a sample from the training data;embedding the sample with a current projection;scoring the embedded sample with current first and second classifiers, the first classifier corresponding to a category of the label of the sample, the second classifier corresponding to a different category, selected from a set of categories;updated the current projection and at least one of the current first and second classifier for iterations where the second classifier generates a higher score than the first classifier, the updated projection serving as the current projection for a subsequent iteration, each of the updated classifiers serving as the current classifier for the respective category for a subsequent iteration; andstoring one of the updated projections as the learned projection; and with a processor, for each of plurality of database images, computing a comparison measure between a query image and the database image, the comparison measure being computed in the embedding space, respective original image representations of the query image and the database image being embedded in the embedding space with the projection; and providing for retrieving at least one of the database images based on the comparison.
地址 Norwalk CT US