摘要 |
A method for identifying a set of key frames from a video sequence including a time sequence of video frames, the method executed at least in part by a data processor, comprising: selecting a set of video frames from the video sequence; identifying a plurality of visually homogeneous regions from each of the selected video frames; defining a set of basis functions, wherein each basis function is associated with a different visually homogeneous region; determining a feature vector for each of the selected video frames; representing each of the determined feature vectors as a sparse combination of the basis functions; for each of the determined feature vectors, determining a sparse set of video frames that contain the visually homogeneous regions corresponding to the basis functions included in the corresponding sparse combination of the basis functions; and analyzing the sparse sets of video frames to identify the set of key frames.
|