发明名称 |
Method for determining similarity of objects represented in images |
摘要 |
A method re-identifies objects in a pair of images by applying a convolutional neural network (CNN). Each layer in the network operates on an output of a previous layer. The layers include a first convolutional layer and a first max pooling layer to determine a feature map, a cross-input neighborhood differences layer to produce neighborhood difference maps, a patch summary layer to produce patch summary feature maps, a first fully connected layer to produce a feature vector representing higher order relationships in the patch summary feature maps, a second fully connected layer to produce two scores representing positive pair and negative pair classes, and a softmax layer to produce positive pair and negative pair probabilities. Then, the positive pair probability is output to signal whether the two images represent the same object or not. |
申请公布号 |
US9436895(B1) |
申请公布日期 |
2016.09.06 |
申请号 |
US201514678102 |
申请日期 |
2015.04.03 |
申请人 |
Mitsubishi Electric Research Laboratories, inc. |
发明人 |
Jones Michael;Marks Tim;Ahmed Ejaz |
分类号 |
G06K9/00;G06K9/66;G06K9/62 |
主分类号 |
G06K9/00 |
代理机构 |
|
代理人 |
Vinokur Gene;McAleenan James;Tsukamoto Hironori |
主权项 |
1. A method for re-identification of objects, comprising steps:
obtaining a pair of images, wherein each image represents an object; applying a convolutional neural network (CNN) to the pair of images, wherein the CNN comprises: a first convolutional layer; a first max pooling layer, which follows the first convolutional layer, wherein the first convolutional layer and the first max pooling layer are applied to each image separately to determine a feature map for each image; a cross-input neighborhood differences layer, which is applied to the feature maps to produce neighborhood difference maps; a patch summary layer, which is applied to the neighborhood difference maps to produce patch summary feature maps; a first fully connected layer, which is applied to the patch summary feature maps to produce a feature vector representing higher order relationships in the patch summary feature maps; a second fully connected layer, which is applied to the feature vector representing higher order relationships to produce two scores representing positive pair and negative pair classes; a softmax layer, which is applied to the two scores to produce positive pair and negative pair probabilities; and outputting the positive pair probability to signal whether the two images represent the same object or not, wherein the steps are performed in a processor. |
地址 |
Cambridge MA US |