摘要 |
A server system 500 is provided for receiving video clips having an associated audio/musical track for processing at the server system. The system comprises a first beat tracking module for generating a first beat time sequence from the audio signal using an estimation of the signal's tempo and chroma accent information. A ceiling and floor function is applied to the tempo estimation to provide integer versions which are subsequently applied separately to a further accent signal derived from a lower-frequency sub-band of the audio signal to generate second and third beat time sequences. A selection module then compares each of the beat time sequences with the further accent signal to identify a best match. |