主权项 |
1. A method comprising:
accessing, by one or more processors, a memory to retrieve audio data that represents audio content received in an audio signal; determining, via audio processing by the one or more processors, a rhythm in the audio content, the rhythm being represented by a rhythm value; selecting, by the one or more processors, a category of the audio content among a set of categories that includes a music category and a talk category, the selecting being based on the rhythm in the audio content and a set of threshold values that each correspond to a different category in the set of categories; detecting, by the one or more processors, a transition between the music category and the talk category by comparing the rhythm value to a threshold value among the set of threshold values; modifying, by the one or more processors, at least one threshold value among the set of threshold values based on the detecting of the transition between the music category and the talk category; and controlling, by the one or more processors, a device based on the detected transition between the music category and the talk category. |