发明名称 Complexity scalable perceptual tempo estimation
摘要 The present document relates to methods and systems for estimating the tempo of a media signal, such as audio or combined video/audio signal. In particular, the document relates to the estimation of tempo perceived by human listeners, as well as to methods and systems for tempo estimation at scalable computational complexity. A method and system for extracting tempo information of an audio signal from an encoded bit-stream of the audio signal comprising spectral band replication data is described. The method comprises the steps of determining a payload quantity associated with the amount of spectral band replication data comprised in the encoded bit-stream for a time interval of the audio signal; repeating the determining step for successive time intervals of the encoded bit-stream of the audio signal, thereby determining a sequence of payload quantities; identifying a periodicity in the sequence of payload quantities; and extracting tempo information of the audio signal from the identified periodicity.
申请公布号 US9466275(B2) 申请公布日期 2016.10.11
申请号 US201013503136 申请日期 2010.10.26
申请人 Dolby International AB 发明人 Biswas Arijit;Hollosi Danilo;Schug Michael
分类号 G10H7/00;G10H1/40 主分类号 G10H7/00
代理机构 代理人
主权项 1. A method for extracting tempo information of an audio signal, the method comprising: providing a compressed, spectral band replication (SBR) encoded bitstream of the audio signal, wherein the encoded bitstream comprises spectral band replication data; determining an amount of data comprised in one or more fill-element fields of the encoded bit-stream for a time-interval of the audio signal; determining a size of SBR payload data comprised in the encoded bit-stream for the time interval of the audio signal based on the amount of data comprised in the one or more fill-element fields of the encoded bit-stream for the time-interval of the audio signal; repeating the determining steps for successive time intervals of the encoded bit-stream of the audio signal, thereby determining a sequence of sizes of SBR payload data; identifying a periodicity in the sequence of sizes of SBR payload data; and extracting tempo information of the audio signal from the identified periodicity, wherein the method is implemented by an audio signal processing device comprising one or more hardware elements.
地址 Amsterdam NL