发明名称 Facial landmark localization using coarse-to-fine cascaded neural networks
摘要 The present invention overcomes the limitations of the prior art by performing facial landmark localization in a coarse-to-fine manner with a cascade of neural network levels, and enforcing geometric constraints for each of the neural network levels. In one approach, the neural network levels may be implemented with deep convolutional neural network. One aspect concerns a system for localizing landmarks on face images. The system includes an input for receiving a face image, and an output for presenting landmarks identified by the system. Neural network levels are coupled in a cascade from the input to the output for the system. Each neural network level produces an estimate of landmarks. The estimate of landmarks is more refined than an estimate of landmark of a previous neural network level.
申请公布号 US9400922(B2) 申请公布日期 2016.07.26
申请号 US201414375674 申请日期 2014.05.29
申请人 Beijing Kuangshi Technology Co., Ltd. 发明人 Zhou Erjin;Fan Haoqiang;Cao Zhimin;Jiang Yuning;Yin Qi
分类号 G06K9/00;G06T1/00;G06T7/00 主分类号 G06K9/00
代理机构 Fenwick & West LLP 代理人 Fenwick & West LLP
主权项 1. A system for localizing landmarks on face images, the system comprising: an input for receiving a face image; an output for presenting landmarks identified by the system; and a plurality of neural network levels coupled in a cascade from the input to the output; wherein each neural network level produces an estimate of landmarks that is more refined than an estimate of landmarks of a previous neural network level, wherein the plurality of neural network levels comprise: at least three cascaded neural network levels for predicting inner points defining landmarks within a face of the face image, the at least three cascaded neural network levels including the following in order from input to output: a first bounding box estimator that receives the face image as input and produces a first cropped face image as output, the first cropped face image estimating a location of the face within the face image for purposes of estimating inner points,a first initial prediction module that receives the first cropped face image as input and produces a first landmarked face image as output, the first landmarked face image containing an initial prediction of inner points within the face image, andfor each of the landmarks to be predicted, a component refinement module that receives the first landmarked face image as input and produces a landmarked component image as output, the landmarked component image containing a refined estimate of inner points defining the landmark, andtwo cascaded neural network levels for predicting outer points defining a contour of the face of the face image, the two cascaded neural network levels including the following in order from input to output: a second bounding box estimator that receives the face image as input and produces a second cropped face image as output, the second cropped face image estimating a location of the face within the face image for purposes of estimating outer points, anda second initial prediction module that receives the second cropped face image as input and produces a second landmarked face image as output, the second landmarked face image containing a prediction of outer points within the face image.
地址 Beijing CN