发明名称 SPEECH PROCESSING DEVICE, METHOD, AND PROGRAM
摘要 PROBLEM TO BE SOLVED: To allow an arbitrary content to be identified with higher accuracy.SOLUTION: An input signal processing unit extracts, from an input signal of a content as an identification object, a sound feature quantity IA1 indicative of a likelihood that the signal in a time-frequency domain is a sine wave, and a sound feature quantity IA2 indicative of individuality of the signal. A reference signal processing unit extracts a sound feature quantity RA1 and a sound feature quantity RA2 corresponding to the sound feature quantity IA1 and the sound feature quantity IA2 from a reference signal of an original content. A matching processing unit calculates a mask pattern from the sound feature quantity IA1 and the sound feature quantity RA1 and calculates similarity between these sound feature quantities. The matching processing unit calculates similarity between the input signal and the reference signal on the basis of the sound feature quantity IA2 and the sound feature quantity RA2, thereby identifying the content. This technique is applicable to a speech processing device.
申请公布号 JP2014115605(A) 申请公布日期 2014.06.26
申请号 JP20130037542 申请日期 2013.02.27
申请人 SONY CORP 发明人 SHIBUYA TAKASHI;ABE MOTOTSUGU;NISHIGUCHI MASAYUKI
分类号 G10L15/10;G06F17/30;G10L25/51 主分类号 G10L15/10
代理机构 代理人
主权项
地址