发明名称 Apparatus and method for automatic classification of audio signals
摘要 The present invention relates to an apparatus and a method for automatic classification of audio signals. <??>Such an apparatus comprises: signal input means (3) for supplying audio signals; audio signal fragmenting means (4) for partitioning audio signals supplied by the signal input means (3) into audio fragments of a predetermined length; feature extracting means (5) for analysing acoustic characteristics of the audio signals comprised in the audio fragments; and classifying means (6) for discriminating the audio fragments provided by the audio signal fragmenting means (4) into a predetermined audio class based on predetermined audio class classsifying models (71,72,73) by using acoustic characteristics of the audio signals comprised in the audio fragments, wherein a predetermined audio class classifying model (71,72,73) is provided for each audio class and each audio class represents a respective kind of audio signals comprised in the corresponding audio fragment. <??>It is a disadvantage that singing voice included in the audio signal frequently is misclassified as speech, particularly when the singing voice is the dominant signal component. The reason is that singing voice is more similar to speech than to music. <??>To solve this problem, according to the present invention an individual predetermined audio class classifying model (71,72,73) is provided for at least each audio class "speech", "music" and "singing voice". <??>Furthermore, the above disadvantage is overcome by the inventive method and the inventive software product. <IMAGE>
申请公布号 EP1542206(A1) 申请公布日期 2005.06.15
申请号 EP20030028573 申请日期 2003.12.11
申请人 SONY INTERNATIONAL (EUROPE) GMBH 发明人 LAM, YIN HAY;MARASEK, KRZYSZTOF;SCHAAF, THOMAS;SCHIMANOWSKI, JUERGEN;KEMP, THOMAS
分类号 G10L25/48;(IPC1-7):G10L11/00 主分类号 G10L25/48
代理机构 代理人
主权项
地址