发明名称 APPARATUS, METHOD AND PROGRAM FOR PROCESSING VIDEO DATA WITH SOUND
摘要 PROBLEM TO BE SOLVED: To provide an apparatus, method and program for processing video data with sound in which voices of a photographer can be effectively utilized in reproduction. SOLUTION: A voice signal analyzing section 52 converts human voices that can be converted into characters, from photographer voice data 66 read from a photographer voice signal recording section 50, into characters through voice recognition processing and outputs them as speech content information. Furthermore, the voice signal analyzing section 52 acquires information of a speech time during which the voices converted into characters are generated. The speech time information is an information (frame) number specifying frames of video data (motion pictures) when starting and completing a speech, speech start time and end time, and the like. A meta-data generating section 54 stores the speech time information, the speech content information and the like in meta-data of a predetermined file format (e.g., xml format). These meta-data are associated with photographer voice data 66 and recorded in a photographer voice signal recording section 50. COPYRIGHT: (C)2007,JPO&INPIT
申请公布号 JP2007104405(A) 申请公布日期 2007.04.19
申请号 JP20050292486 申请日期 2005.10.05
申请人 FUJIFILM CORP 发明人 TERAYOKO SUNAO;SAWANO TETSUYA
分类号 H04N5/928;G10L15/00;H04N5/225 主分类号 H04N5/928
代理机构 代理人
主权项
地址