摘要 |
A method of encoding signal segments which represent unvoiced plosives. The signal segments to be encoded are contained within a speech signal divided into m=1, . . . , N frames. Each frame is subdivided into l=1, . . . , L subframes. The speech signal has a gain gm(l) within each subframe. An energy measure em(l) representative of the signal segments' energy content is defined. An energy threshold eth(l) representative of a sudden energy change characteristic of an unvoiced plosive is also defined. For each frame, the energy measure em(l) and the energy threshold eth(l) are derived for each subframe within that frame. If em(l)<=eth(l) for each subframe within a particular frame, then a plosive locator lpl=0 and a plosive index ipl=0 are assigned to that frame to indicate absence of a plosive within that frame. If em(l)>eth(l) for any subframe within the frame, then that frame's plosive locator lpl is assigned a non-zero value, with the plosive locator's value indicating location of the plosive at a transition point immediately following that one of the subframes within the frame for which em(l)-eth(l) is greatest; and, that frame's plosive index ipl is assigned a non-zero value representing presence of a plosive within that frame. |