发明名称 |
Learning beautiful and ugly visual attributes |
摘要 |
A method for learning visual attribute labels for images includes, from textual comments associated with a corpus of images, identifying a set of candidate textual labels that are predictive of aesthetic scores associated with images in the corpus. The candidate labels in the set are clustered into a plurality of visual attribute clusters based on similarity and each of the clusters assigned a visual attribute label. For each of the visual attribute labels, a classifier is trained using visual representations of images in the corpus and respective visual attribute labels. The visual attribute labels are evaluated, based on performance of the trained classifier. A subset of the visual attribute labels is retained, based on the evaluation. The visual attribute labels can be used in processes such as image retrieval, image labeling, and the like. |
申请公布号 |
US9082047(B2) |
申请公布日期 |
2015.07.14 |
申请号 |
US201313971092 |
申请日期 |
2013.08.20 |
申请人 |
XEROX CORPORATION |
发明人 |
Marchesotti Luca |
分类号 |
G06K9/62 |
主分类号 |
G06K9/62 |
代理机构 |
Fay Sharpe LLP |
代理人 |
Fay Sharpe LLP |
主权项 |
1. A method for learning visual attribute labels for images comprising:
from textual comments associated with a corpus of images, identifying a set of candidate textual labels that are predictive of aesthetic scores associated with images in the corpus; clustering the candidate labels in the set into a plurality of visual attribute clusters based on similarity and assigning each of the clusters a visual attribute label; for each of the visual attribute labels, training a visual attribute classifier using visual representations of images in the corpus and respective visual attribute labels; evaluating the visual attribute labels based on performance of the trained visual attribute classifiers, the evaluating comprising comparing performance of each of the visual attribute classifiers with a predefined threshold; and retaining a subset of the visual attribute labels based on the evaluation, including retaining the visual attribute labels for the visual attribute classifiers that meet the performance threshold; wherein at least one of the identifying a set of candidate textual labels, clustering the candidate labels, training the classifier, and evaluating the classifier performance is performed with a processor. |
地址 |
Norwalk CT US |