发明名称 Information processing device, information processing method and program
摘要 The present invention relates to an information processing device, an information processing method, and a program capable of easily adding an annotation to content.;A feature amount extracting unit 21 extracts an image feature amount of each frame of an image of learning content and extracts word frequency information regarding frequency of appearance of each word in a description text describing a content of the image of the learning content (for example, a text of a caption) as a text feature amount of the description text. A model learning unit 22 learns an annotation model, which is a multi-stream HMM, by using an annotation sequence for annotation, which is a multi-stream including the image feature amount of each frame and the text feature amount. The present invention may be applied when adding the annotation to the content such as a television broadcast program, for example.
申请公布号 US9280709(B2) 申请公布日期 2016.03.08
申请号 US201113814170 申请日期 2011.08.02
申请人 SONY CORPORATION 发明人 Suzuki Hirotaka;Ito Masato
分类号 G06F17/24;G06K9/00;G06K9/18;G06F17/30 主分类号 G06F17/24
代理机构 Hazuki International, LLC 代理人 Hazuki International, LLC
主权项 1. An information processing device, comprising: one or more processors configured to: extract an image feature amount of each frame of an image of learning content; extract word frequency information regarding frequency of appearance of each word in a description text describing a content of the image of the learning content as a text feature amount of the description text; learn an annotation model, which is a multi-stream HMM (hidden Markov model), by using an annotation sequence for annotation, which is a multi-stream including the image feature amount and the text feature amount and obtain an inter-state distance from one state to another state of the annotation model such that an error is minimized between i) the inter-state distance and ii) a Euclidean distance from the one state to the another state on a model map on which states of the annotation model are arranged.
地址 Toyko JP