发明名称 |
Method and system for distinguishing speech from music in a digital audio signal in real time |
摘要 |
The present invention relates to method and system for distinguishing speech from music in a digital audio signal in real time. A method for distinguishing speech from music in a digital audio signal in real time for the sound segments that have been segmented from an input signal of the digital sound processing systems by means of a segmentation unit on the base of homogeneity of their properties, comprises the steps of: (a) framing an input signal into sequence of overlapped frames by a windowing function; (b) calculating frame spectrum for every frame by FFT transform; (c) calculating segment harmony measure on base of frame spectrum sequence; (d) calculating segment noise measure on base of the frame spectrum sequence; (e) calculating segment tail measure on base of the frame spectrum sequence; (f) calculating segment drag out measure on base of the frame spectrum sequence; (g) calculating segment rhythm measure on base of the frame spectrum sequence; and (h) making the distinguishing decision based on characteristics calculated.
|
申请公布号 |
US7191128(B2) |
申请公布日期 |
2007.03.13 |
申请号 |
US20030370063 |
申请日期 |
2003.02.21 |
申请人 |
LG ELECTRONICS INC. |
发明人 |
SALL MIKHAEL A.;GRAMNITSKIY SERGEI N.;MAIBORODA ALEXANDR L.;REDKOV VICTOR V.;TIKHOTSKY ANATOLI I.;VIKTOROV ANDREI B. |
分类号 |
G10L11/00;G10L19/02;G10L11/02 |
主分类号 |
G10L11/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|