发明名称 METHODS AND APPARATUSES FOR VIDEO SEGMENTATION, CLASSIFICATION, AND RETRIEVAL USING IMAGE CLASS STATISTICAL MODELS
摘要 Techniques for classifying video frames using statistical models of transform coefficients are disclosed. After optionally being decimated in time and space, image frames are transformed using a discrete cosine transform or Hadamard transform. The methods disclosed model image composition and operate on grayscale images. The resulting transform matrices are reduced using truncation, principal component analysis, or linear discriminant analysis to produce feature vectors. Feature vectors of training images for image classes are used to compute image class statistical models. Once image class statistical models are derived, individual frames are classified by the maximum likelihood resulting from the image class statistical models. Thus, the probabilities that a feature vector derived from a frame would be produced from each of the image class statistical models are computed. The frame is classified into the image class corresponding to the image class statistical model which produced the highest probability for the feature vector derived from the frame. Optionally, frame sequence information is taken into account by applying a hidden Markov model to represent image class transitions from the previous frame to the current frame. After computing all class probabilities for all frames in the video or sequence of frames using the image class statistical models and the image class transition probabilities, the final class is selected as having the maximum likelihood. Previous frames are selected in reverse order based upon their likelihood given determined current states.
申请公布号 US2002028021(A1) 申请公布日期 2002.03.07
申请号 US19990266637 申请日期 1999.03.11
申请人 FOOTE JONATHAN T.;WILCOX LYNN;GIRGENSOHN ANDREAS 发明人 FOOTE JONATHAN T.;WILCOX LYNN;GIRGENSOHN ANDREAS
分类号 H04N5/76;G06F17/30;G06K9/62;G06K9/72;G06T7/00;G10L15/10;G10L15/14;H04N7/15;H04N7/30;(IPC1-7):G06K9/62 主分类号 H04N5/76
代理机构 代理人
主权项
地址