发明名称 Hierarchical Interlinked Multi-scale Convolutional Network for Image Parsing
摘要 A disclosed facial recognition system (and method) includes face parsing. In one approach, the face parsing is based on hierarchical interlinked multiscale convolutional neural network (HIM) to identify locations and/or footprints of components of a face image, The HIM generates multiple levels of image patches from different resolution images of the face image, where image patches for different levels have different resolutions. Moreover, the HIM integrates the image patches for different levels to generate interlinked image patches for different levels, where interlinked image patches for different levels have different resolutions. Furthermore, the HIM combines the interlinked image patches to identify refined locations and/or footprints of components.
申请公布号 US2016104053(A1) 申请公布日期 2016.04.14
申请号 US201414402030 申请日期 2014.10.10
申请人 Beijing Kuangshi Technology Co., Ltd. 发明人 Yin Qi;Cao Zhimin;Zhou Yisu
分类号 G06K9/46;G06T7/00;G06T11/60;G06T3/00;G06K9/00;G06K9/66;G06T3/40 主分类号 G06K9/46
代理机构 代理人
主权项 1. A system for parsing an image into components, the system comprising: a hierarchical interlinked multiscale convolutional neural network (HIM) for locating the components from the image, the HIM comprising: a level generator configured to receive the image and to generate N levels of image patches from the image, N>2, wherein the image patches for different levels n have different resolutions R(n) and the image patches for a level n are generated from the image resampled to resolution R(n);an interlinked combiner configured to receive the N levels of image patches from the level generator and to generate M levels of interlinked image patches from the N levels of image patches, 2<M≦N, wherein the interlinked image patches for different levels in have different resolutions R(m) and the interlinked image patches for a level in are generated from an input group m of image patches comprising (i) image patches from level n with R(n)=R(m), and (ii) image patches from one or more levels n with R(n)≠R(m) where such image patches have been resampled to resolution R(m); andan aggregator configured to locate the components by combining the M levels of interlinked image patches.
地址 Beijing CN