主权项 |
1. A system comprising:
a database for storing a plurality of media items; one or more computers connected to the database and configured to:
obtain the plurality of media items, each of the plurality of media items being identified as either (i) a concept media item that has been classified as a media item in which a particular visual concept appears or (ii) a non-concept media item that has been classified as a media item in which the particular visual concept does not appear;obtain a plurality of concept segments, wherein each of the concept segments is a segment that has been extracted from a concept media item;obtain a plurality of non-concept segments, wherein each non-concept segment is a segment that has been extracted from a non-concept media item, wherein each concept segment and each non-concept segment is represented in a feature space;for each non-concept segment, identify a closest concept segment, wherein the closest concept segment is the concept segment that is closest to the non-concept segment of any of the plurality of concept segments, wherein the closest concept segment is identified based upon pairwise distances between all of the concept segments and all of the non-concept segments in the feature space;determine, for each concept segment, a respective count of instances in which the concept segment is identified as the closest concept segment to one of the non-concept segments;rank each concept segment such that the ranking reflects a respective likelihood that the concept segment contains the particular visual concept by ranking the concept segments such that concept segments having lower counts are favored over concept segments having higher counts; andlabel concept segments that are below a threshold rank in the ranking as not containing the particular visual concept. |