发明名称 |
PERSONAL NAME ASSIGNMENT APPARATUS AND METHOD |
摘要 |
<p><P>PROBLEM TO BE SOLVED: To specify a scene where a desired performer appears based only on a received video picture. <P>SOLUTION: The personal name assignment apparatus includes: a means 103 for obtaining a speaker time period as a first section including a speaker name specified by information indicating the speaker name and an utterance time period of the speaker; a means 101 for obtaining a second section including an utterance from a non-silent section in the video picture; a means 105 for extracting a first feature amount characterizing the speaker from a speech waveform of the second section when the second section is included in the first section and associating the speaker name corresponding to the first section with the feature amount; a means 106 for creating a speaker model of the speaker from the feature amount for each speaker; a means 108 for obtaining a third section that is an utterance time period to be recognized; a means 109 for extracting a second feature amount characterizing the speaker from a speech waveform of the second section when the second section is included in the third section; a means 110 for calculating similarity between the feature amount of the speaker model for each speaker and the second feature amount; and a means 111 for recognizing the speaker name of the speaker model of a setting condition as the performer, in the similarity. <P>COPYRIGHT: (C)2010,JPO&INPIT</p> |
申请公布号 |
JP2009237285(A) |
申请公布日期 |
2009.10.15 |
申请号 |
JP20080083430 |
申请日期 |
2008.03.27 |
申请人 |
TOSHIBA CORP |
发明人 |
SHIMOMORI HIROSHI;UEHARA TATSUYA |
分类号 |
G10L17/00;G10L15/00;G10L15/04;G10L15/06 |
主分类号 |
G10L17/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|