发明名称 Correction of energy as input feature for speech processing
摘要 The invention provides a method for processing speech comprising the steps of receiving a speech input (SI) of a speaker, generating speech parameters (SP) from said speech input (SI), determining parameters describing an absolute loudness (L) of said speech input (SI), and evaluating (EV) said speech input (SI) and/or said speech parameters (SP) using said parameters describing the absolute loudness (L). In particular, the step of evaluation (EV) comprises a step of emotion recognition and/or speaker identification. Further, a microphone array comprising a plurality of microphones is used for determining said parameters describing the absolute loudness. With a microphone array the distance of the speaker from the microphone array can be determined and the loudness can be normalized by the distance. Thus, the absolute loudness becomes independent from the distance of the speaker to the microphone, and absolute loudness can now be used as an input parameter for emotion recognition and/or speaker identification. <IMAGE>
申请公布号 EP1429314(A1) 申请公布日期 2004.06.16
申请号 EP20020027964 申请日期 2002.12.13
申请人 SONY INTERNATIONAL (EUROPE) GMBH 发明人 KOMPE, RALF;KEMP, THOMAS;TATO, RAQUEL
分类号 G10L15/02;G10L15/10;G10L17/02;G10L17/26;H04R1/40;(IPC1-7):G10L17/00 主分类号 G10L15/02
代理机构 代理人
主权项
地址