发明名称 |
SPEECH PROCESSING DEVICE, METHOD, AND PROGRAM |
摘要 |
PROBLEM TO BE SOLVED: To allow an arbitrary content to be identified with higher accuracy.SOLUTION: An input signal processing unit extracts, from an input signal of a content as an identification object, a sound feature quantity IA1 indicative of a likelihood that the signal in a time-frequency domain is a sine wave, and a sound feature quantity IA2 indicative of individuality of the signal. A reference signal processing unit extracts a sound feature quantity RA1 and a sound feature quantity RA2 corresponding to the sound feature quantity IA1 and the sound feature quantity IA2 from a reference signal of an original content. A matching processing unit calculates a mask pattern from the sound feature quantity IA1 and the sound feature quantity RA1 and calculates similarity between these sound feature quantities. The matching processing unit calculates similarity between the input signal and the reference signal on the basis of the sound feature quantity IA2 and the sound feature quantity RA2, thereby identifying the content. This technique is applicable to a speech processing device. |
申请公布号 |
JP2014115605(A) |
申请公布日期 |
2014.06.26 |
申请号 |
JP20130037542 |
申请日期 |
2013.02.27 |
申请人 |
SONY CORP |
发明人 |
SHIBUYA TAKASHI;ABE MOTOTSUGU;NISHIGUCHI MASAYUKI |
分类号 |
G10L15/10;G06F17/30;G10L25/51 |
主分类号 |
G10L15/10 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|