摘要 |
According to an exemplary embodiment, provided is an audio signal processing device, the device comprising: a tempo estimator configured to estimate the tempo of an audio signal from a sequence of samples which indicate the audio signal in a time domain; an energy level calculator configured to derive a plurality of partially overlapping sub-sequences from the sequence and to calculate energy levels corresponding to the plurality of sub-sequences, respectively; and a highlight extractor configured to extract, on the basis of the energy levels, one of the plurality of sub-sequences as the highlight portion of the audio signal, wherein each of the sub-sequences has a time duration defined according to the estimated tempo, and each of the energy levels indicates the energy of the samples of the sub-sequences corresponding to the energy levels, respectively. |