发明名称 Method for determining similarity of objects represented in images
摘要 A method re-identifies objects in a pair of images by applying a convolutional neural network (CNN). Each layer in the network operates on an output of a previous layer. The layers include a first convolutional layer and a first max pooling layer to determine a feature map, a cross-input neighborhood differences layer to produce neighborhood difference maps, a patch summary layer to produce patch summary feature maps, a first fully connected layer to produce a feature vector representing higher order relationships in the patch summary feature maps, a second fully connected layer to produce two scores representing positive pair and negative pair classes, and a softmax layer to produce positive pair and negative pair probabilities. Then, the positive pair probability is output to signal whether the two images represent the same object or not.
申请公布号 US9436895(B1) 申请公布日期 2016.09.06
申请号 US201514678102 申请日期 2015.04.03
申请人 Mitsubishi Electric Research Laboratories, inc. 发明人 Jones Michael;Marks Tim;Ahmed Ejaz
分类号 G06K9/00;G06K9/66;G06K9/62 主分类号 G06K9/00
代理机构 代理人 Vinokur Gene;McAleenan James;Tsukamoto Hironori
主权项 1. A method for re-identification of objects, comprising steps: obtaining a pair of images, wherein each image represents an object; applying a convolutional neural network (CNN) to the pair of images, wherein the CNN comprises: a first convolutional layer; a first max pooling layer, which follows the first convolutional layer, wherein the first convolutional layer and the first max pooling layer are applied to each image separately to determine a feature map for each image; a cross-input neighborhood differences layer, which is applied to the feature maps to produce neighborhood difference maps; a patch summary layer, which is applied to the neighborhood difference maps to produce patch summary feature maps; a first fully connected layer, which is applied to the patch summary feature maps to produce a feature vector representing higher order relationships in the patch summary feature maps; a second fully connected layer, which is applied to the feature vector representing higher order relationships to produce two scores representing positive pair and negative pair classes; a softmax layer, which is applied to the two scores to produce positive pair and negative pair probabilities; and outputting the positive pair probability to signal whether the two images represent the same object or not, wherein the steps are performed in a processor.
地址 Cambridge MA US
您可能感兴趣的专利