发明名称 |
VOICE SIGNAL PROCESSING APPARATUS, VOICE SIGNAL PROCESSING METHOD, AND PROGRAM |
摘要 |
<P>PROBLEM TO BE SOLVED: To provide an apparatus for separating with high accuracy a command of a specified sound source from a voice signal in which a plurality of sounds are mixed. <P>SOLUTION: Learning data consisting of base frequencies B corresponding to sound sources respectively is generated based on a learning voice signal containing sounds from a plurality of the sound sources, and all base frequencies Ball combining the base frequencies B which correspond to the sound sources respectively. Moreover, time frequency analysis is applied to an input voice signal to produce a time frequency analysis result. Base decomposition applying the all base frequencies Ball is applied to the time frequency analysis result for the input voice signal to produce a base activity H for the input voice signal. Identification processing of the base activity H produced last is executed to perform command identification. Highly accurate command identification is achieved by sound source separation processing based on the learning data. <P>COPYRIGHT: (C)2012,JPO&INPIT |
申请公布号 |
JP2012163918(A) |
申请公布日期 |
2012.08.30 |
申请号 |
JP20110026240 |
申请日期 |
2011.02.09 |
申请人 |
SONY CORP |
发明人 |
MITSUFUJI YUKI;NISHIGUCHI MASAYUKI |
分类号 |
G10L15/02;G06N3/00;G10L11/00;G10L15/20 |
主分类号 |
G10L15/02 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|