发明名称 |
Complexity scalable perceptual tempo estimation |
摘要 |
The present document relates to methods and systems for estimating the tempo of a media signal, such as audio or combined video/audio signal. In particular, the document relates to the estimation of tempo perceived by human listeners, as well as to methods and systems for tempo estimation at scalable computational complexity. A method and system for extracting tempo information of an audio signal from an encoded bit-stream of the audio signal comprising spectral band replication data is described. The method comprises the steps of determining a payload quantity associated with the amount of spectral band replication data comprised in the encoded bit-stream for a time interval of the audio signal; repeating the determining step for successive time intervals of the encoded bit-stream of the audio signal, thereby determining a sequence of payload quantities; identifying a periodicity in the sequence of payload quantities; and extracting tempo information of the audio signal from the identified periodicity. |
申请公布号 |
US9466275(B2) |
申请公布日期 |
2016.10.11 |
申请号 |
US201013503136 |
申请日期 |
2010.10.26 |
申请人 |
Dolby International AB |
发明人 |
Biswas Arijit;Hollosi Danilo;Schug Michael |
分类号 |
G10H7/00;G10H1/40 |
主分类号 |
G10H7/00 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for extracting tempo information of an audio signal, the method comprising:
providing a compressed, spectral band replication (SBR) encoded bitstream of the audio signal, wherein the encoded bitstream comprises spectral band replication data; determining an amount of data comprised in one or more fill-element fields of the encoded bit-stream for a time-interval of the audio signal; determining a size of SBR payload data comprised in the encoded bit-stream for the time interval of the audio signal based on the amount of data comprised in the one or more fill-element fields of the encoded bit-stream for the time-interval of the audio signal; repeating the determining steps for successive time intervals of the encoded bit-stream of the audio signal, thereby determining a sequence of sizes of SBR payload data; identifying a periodicity in the sequence of sizes of SBR payload data; and extracting tempo information of the audio signal from the identified periodicity, wherein the method is implemented by an audio signal processing device comprising one or more hardware elements. |
地址 |
Amsterdam NL |