主权项 |
1. A method for tracking a moving object in a compressed video stream, the method comprising:
receiving, by a computer system having a processor coupled to a memory device, a compressed video stream that comprises a plurality of frames, each frame including motion vector data and DC coefficient data; using a bit stream parser stored in the memory device to parse the compressed video stream, thereby extracting motion vector data and DC coefficient data for a selected frame of the compressed video stream, the selected frame comprising a plurality of macro-blocks; using an object detection sub-module stored in the memory device and the extracted motion vector data to identify a plurality of foreground macro-blocks from amongst the plurality of macro-blocks, the foreground macro-blocks corresponding to motion that is distinguishable from a background motion model; using a grouping and labeling sub-module stored in the memory device to group a subset of the plurality of foreground macro-blocks based on a feature map that depends on the extracted motion vector data and DC coefficient data that is associated with the grouped subset of foreground macro-blocks; and using a validation and refinement sub-module stored in the memory device to validate the grouped subset of foreground macro-blocks based on a comparison of a cost metric between the selected frame and a temporally adjacent frame, the cost metric depending on frame-to-frame motion and variance of the grouped subset of foreground macro-blocks. |