发明名称 Method and apparatus for processing video sequences
摘要 A method for processing a video sequence having a plurality of frames includes the steps of: extracting features from each of the frames, determining correspondences between the extracted features from two of the frames, estimating motion in the video sequence based on the determined correspondences, generating a background mosaic for the video sequence based on the estimated motion, and performing foreground-background segmentation on each of the frames based on the background mosaic.
申请公布号 US9214030(B2) 申请公布日期 2015.12.15
申请号 US200812451264 申请日期 2008.04.25
申请人 Thomson Licensing 发明人 Sole Joel;Huang Yu;Llach Joan
分类号 H04N7/30;H04N7/26;H04N7/64;G06T7/20;G06T5/00 主分类号 H04N7/30
代理机构 代理人 Shedd Robert D.;Lin Reitseng
主权项 1. A method for processing a video sequence comprised of a plurality of frames, said method comprising: extracting a feature from each of said frames; determining correspondences between said extracted feature from said frames; determining motion in said video sequence based on said determined correspondences, said determining motion using a modified random sample consensus algorithm that selects samples from buckets, iterates a model estimation multiple times including all inliers obtained so far in each iteration, and finds a model that maximizes a data likelihood, wherein a motion hypothesis is derived with a least squares method when the obtained inliers are determined to be less than a number, and the motion hypothesis is derived with a weighted total least squares method otherwise; generating a forward warping matrix and a background warping matrix for each of said frames based on said determined motion; generating a forward warping error and a backward warping error for each of said frames based on said forward warping matrix and said background warping matrix; generating a foreground/background mask for each of said frames based on said forward warping error and said backward warping error; and generating a background mosaic by mapping said frames to a common coordinate system; and extracting foreground information from each of said frames based on said background mosaic.
地址 Boulogne-Billancourt FR