发明名称 Image processing apparatus, image processing method, program, and recording medium for learning from moving images
摘要 An image processing apparatus includes: an image feature outputting unit that outputs each of image features in correspondence with a time of the frame; a foreground estimating unit that estimates a foreground image at a time s by executing a view transform as a geometric transform on a foreground view model and outputs an estimated foreground view; a background estimating unit that estimates a background image at the time s by executing a view transform as a geometric transform on a background view model and outputs an estimated background view; a synthesized view generating unit that generates a synthesized view by synthesizing the estimated foreground and background views; a foreground learning unit that learns the foreground view model based on an evaluation value; and a background learning unit that learns the background view model based on the evaluation value by updating the parameter of the foreground view model.
申请公布号 US8849017(B2) 申请公布日期 2014.09.30
申请号 US201213427199 申请日期 2012.03.22
申请人 Sony Corporation 发明人 Ito Masato;Sabe Kohtaro;Yokono Jun
分类号 G06K9/62;G06T7/00 主分类号 G06K9/62
代理机构 Finnegan, Henderson, Farabow, Garrett & Dunner, L.L.P. 代理人 Finnegan, Henderson, Farabow, Garrett & Dunner, L.L.P.
主权项 1. An image processing apparatus comprising: an image feature outputting unit that outputs each of image features, which are formed as features of a plurality of feature points of images of each frame in data of an input moving image, in correspondence with a time of the frame; a foreground estimating unit that estimates a foreground image at a time s by executing a view transform as a geometric transform on a foreground view model having the image feature of a foreground image in the image as a parameter in regard to the image feature at the time s, and then outputs an estimated foreground view; a background estimating unit that estimates a background image at the time s by executing a view transform as a geometric transform on a background view model having the image feature of a background image in the image as a parameter in regard to the image feature at the time s, and then outputs an estimated background view; a synthesized view generating unit that generates a synthesized view by synthesizing the estimated foreground view and the estimated background view; a foreground learning unit that learns the foreground view model based on an evaluation value obtained by comparison between the synthesized view and the image feature at the time s by updating the parameter of the foreground view model based on a stochastic generation model; and a background learning unit that learns the background view model based on the evaluation value by updating the parameter of the background view model based on a stochastic generation model.
地址 Tokyo JP