发明名称 Packet loss concealment for speech coding
摘要 A speech coding method of reducing error propagation due to voice packet loss, is achieved by limiting or reducing a pitch gain only for the first subframe or the first two subframes within a speech frame. The method is used for a voiced speech class. A pitch cycle length is compared to a subframe size to decide to reduce the pitch gain for the first subframe or the first two subframes within the frame. A strongly voiced class is decided by checking if the pitch lags are stable and the pitch gains are high enough with the frame; for the strongly voiced frame, the pitch lags and the pitch gains can be encoded more efficiently than other speech classes.
申请公布号 US9336790(B2) 申请公布日期 2016.05.10
申请号 US201414175195 申请日期 2014.02.07
申请人 HUAWEI TECHNOLOGIES CO., LTD 发明人 Gao Yang
分类号 G10L19/09;G10L19/005 主分类号 G10L19/09
代理机构 Huawei Technologies Co., Ltd 代理人 Huawei Technologies Co., Ltd
主权项 1. A method of improving packet loss concealment for speech coding while still profiting from a pitch prediction or Long-Term Prediction (LTP), the method comprising: classifying a plurality of speech frames into a plurality of classes, and wherein at least for one of the classes, the following steps are included: comparing a pitch cycle length with a subframe size within a speech frame when the subframe size is fixed or deciding a first subframe size based on a pitch cycle length within a speech frame when the first subframe size is variable;having an LTP excitation component;having a second excitation component;determining an initial energy of the LTP excitation component for every subframe within a frame of speech signal by using a regular method of minimizing a coding error or a weighted coding error at an encoder;reducing or limiting the energy of the LTP excitation component to be smaller than the initial energy of the LTP excitation component for the first subframe or the first two subframes within the frame based at least in part on the pitch cycle length compared to the subframe size;keeping the energy of the LTP excitation component to be equal to the initial energy of the LTP excitation component for any other subframe rather than the first subframe or the first two subframes within the frame;encoding the energy of the LTP excitation component for every subframe of the frame at the encoder; andforming an excitation by including the LTP excitation component and the second excitation component.
地址 Shenzhen CN