摘要 |
A system detects a face within an image by receiving the image which includes a plurality of pixels, where a plurality of the pixels of the image is represented by respective groups of at least three values. The image is filtered by transforming a plurality of the respective groups of the at least three values to respective groups of less than three values, where the respective groups of the less than three values has less dependency on brightness than the respective groups of the at least three values. Regions of the image representative of skin-tones are determined based on the filtering. A first distribution of the regions of the image representative of the skin-tones in a first direction is calculated. A second distribution of the regions of the image representative of the skin-tones in a second direction is calculated, where the first direction and the second direction are different. The face within the image is located based on the first distribution and the second distribution. The estimated face location may also be used for tracking the face between frames of a video. |