发明名称 Video summarization using group sparsity analysis
摘要 A method for identifying a set of key video frames from a video sequence comprising extracting feature vectors for each video frame and applying a group sparsity algorithm to represent the feature vector for a particular video frame as a group sparse combination of the feature vectors for the other video frames. Weighting coefficients associated with the group sparse combination are analyzed to determine video frame clusters of temporally-contiguous, similar video frames. A summary is formed based on the determined video frame clusters.
申请公布号 US9076043(B2) 申请公布日期 2015.07.07
申请号 US201213565926 申请日期 2012.08.03
申请人 Kodak Alaris Inc. 发明人 Kumar Mrityunjay;Loui Alexander C.;Pillman Bruce Harold
分类号 H04N5/445;G06K9/00;G11B27/034;G11B27/28;G06K9/62 主分类号 H04N5/445
代理机构 Hogan Lovells US LLP 代理人 Hogan Lovells US LLP
主权项 1. A method for forming a video summary from a video sequence including a time sequence of video frames, each video frame including an array of image pixels having pixel values, comprising: a) selecting a set of video frames from the video sequence; b) extracting a feature vector for each video frame in the set of video frames; c) applying a group sparsity algorithm to represent the feature vector for a particular video frame as a group sparse combination of the feature vectors for the other video frames in the set of video frames, each feature vector for the other video frames in the group sparse combination having an associated weighting coefficient, wherein the weighting coefficients for feature vectors corresponding to other video frames that are most similar to the particular video frame are non-zero, and the weighting coefficients for feature vectors corresponding to other video frames that are most dissimilar from the particular video frame are zero; d) analyzing the weighting coefficients to determine a video frame cluster of temporally-contiguous, similar video frames that includes the particular video frame; e) repeating steps c)-d) for a plurality of particular video frames to provide a plurality of video frame clusters; f) selecting a subset of the video frame clusters; g) forming the video summary by combining video frames from the selected video frame clusters; and h) storing the video summary in a processor-accessible memory; wherein the method is performed, at least in part, using a data processor, and wherein the video summary is stored by extracting video frames from the video sequence corresponding to the selected video frame clusters and storing the extracted frames in a separate video file.
地址 Rochester NY US