发明名称 System and method for using segmentation to identify object location in images
摘要 A computing device segments an image into a plurality of segments, wherein each segment of the plurality of segments has a segment location and a set of pixels that share visual characteristics. The computing device determines an initial set of bounding boxes for the image based on the plurality of segments. The computing device determines a reduced set of bounding boxes based on combining bounding boxes of the initial set of bounding boxes, the reduced set of bounding boxes corresponding to one or more objects in the image, each of the one or more objects having an object class and an object location.
申请公布号 US9483701(B1) 申请公布日期 2016.11.01
申请号 US201113299281 申请日期 2011.11.17
申请人 GOOGLE INC. 发明人 Kwatra Vivek;Yagnik Jay;Toshev Alexander T.
分类号 G06K9/62;G06K9/00;G06K9/32 主分类号 G06K9/62
代理机构 Lowenstein Sandler LLP 代理人 Lowenstein Sandler LLP
主权项 1. A method comprising: maintaining a data structure having a plurality of entries associated with a training set of images, each entry identifying a set of visual characteristics of a prototypical segment from the training set of images and an associated set of potential bounding boxes, the prototypical segment representing a combination of similar segments across different images from the training set of images; segmenting, by a computing device, a current image into a plurality of segments, wherein each segment of the plurality of segments of the current image has a segment location and a set of pixels that share visual characteristics; for each segment of the current image, finding in the data structure an entry identifying a set of visual characteristics of a respective prototypical segment that are most similar to visual characteristics of the segment of the current image to determine a set of potential bounding boxes associated with the entry; determining, by the computing device, an initial set of bounding boxes for the current image based on sets of potential bounding boxes for the plurality of segments of the current image; and determining a reduced set of bounding boxes based on combining bounding boxes of the initial set of bounding boxes, the reduced set of bounding boxes corresponding to one or more objects in the current image, each of the one or more objects having an object class and an object location.
地址 Mountain View CA US