AUTOMATIC EXTRACTION OF MUSICAL PORTIONS OF AN AUDIO STREAM
摘要
Music and non-music portions in an audio stream are identified. The audio stream is digitized and segmented into frames. Selected frames are passed through a filter bank which includes filters having bandwidths approximately proportional to their center frequencies. The spectral flux for each selected frame is calculated and smoothed. Frames having a smoothed spectral flux below a threshold value are associated with music, and frames having a smoothed spectral flux above a threshold value are associated with non-music.
申请公布号
WO2005060337(A2)
申请公布日期
2005.07.07
申请号
WO2004IB04085
申请日期
2004.12.08
申请人
NOKIA CORPORATION;NOKIA, INC.;KIRKEBY, OLE;HUOPANIEMI, JYRI;SORSA, TIMO