发明名称 |
Audio Classification Based on Perceptual Quality for Low or Medium Bit Rates |
摘要 |
The quality of encoded signals can be improved by reclassifying AUDIO signals carrying non-speech data as VOICE signals when periodicity parameters of the signal satisfy one or more criteria. In some embodiments, only low or medium bit rate signals are considered for re-classification. The periodicity parameters can include any characteristic or set of characteristics indicative of periodicity. For example, the periodicity parameter may include pitch differences between subframes in the audio signal, a normalized pitch correlation for one or more subframes, an average normalized pitch correlation for the audio signal, or combinations thereof. Audio signals which are re-classified as VOICED signals may be encoded in the time-domain, while audio signals that remain classified as AUDIO signals may be encoded in the frequency-domain. |
申请公布号 |
US2017116999(A1) |
申请公布日期 |
2017.04.27 |
申请号 |
US201715398321 |
申请日期 |
2017.01.04 |
申请人 |
HUAWEI TECHNOLOGIES CO.,LTD. |
发明人 |
Gao Yang |
分类号 |
G10L19/24;G10L25/93 |
主分类号 |
G10L19/24 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for encoding signals, the method, which is performed by an audio coder, comprising:
receiving a digital signal comprising audio data; classifying the digital signal as an AUDIO signal; re-classifying the digital signal as a VOICED signal when classifying conditions are satisfied, wherein, the classifying conditions include: pitch differences between sub-frames in the digital signal are less than a first threshold, an average normalized pitch correlation value for the sub-frames in the digital signal is greater than a second threshold, and a smoothed pitch correlation obtained according to the average normalized pitch correlation value is greater than a third threshold; wherein each of the pitch differences is an absolute value of the difference between two pitch values corresponding to two sub-frames respectively; and encoding the re-classified VOICED signal in the time-domain when one or more encoding conditions are satisfied, wherein the one or more encoding conditions include: a coding rate of the digital signal is below a fourth threshold. |
地址 |
Shenzhen CN |