发明名称 SPATIAL PYRAMID POOLING NETWORKS FOR IMAGE PROCESSING
摘要 Spatial pyramid pooling (SPP) layers are combined with convolutional layers and partition an input image into divisions from finer to coarser levels, and aggregate local features in the divisions. A fixed-length output may be generated by the SPP layer(s) regardless of the input size. The multi-level spatial bins used by the SPP layer(s) may provide robustness to object deformations. An SPP layer based system may pool features extracted at variable scales due to the flexibility of input scales making it possible to generate a full-image representation for testing. Moreover, SPP networks may enable feeding of images with varying sizes or scales during training, which may increase scale-invariance and reduce the risk of over-fitting.
申请公布号 US2016104056(A1) 申请公布日期 2016.04.14
申请号 US201514617936 申请日期 2015.02.10
申请人 Microsoft Technology Licensing, LLC 发明人 He Kaiming;Sun Jian;Zhang Xiangyu;Ren Shaoqing
分类号 G06K9/62;G06K9/46;G06N3/04;G06K9/66 主分类号 G06K9/62
代理机构 代理人
主权项 1. A method to perform image processing, the method comprising: receiving an input image; generating feature maps by one or more filters on one or more convolutional layers of a neural network; spatially pooling responses of each filter of a top convolutional layer at a spatial pyramid pooling (SPP) network following the top convolutional layer, wherein the SPP network comprises one or more layers; and providing outputs of a top SPP network layer to a fully-connected layer as fixed dimensional vectors.
地址 Redmond WA US