摘要 |
PURPOSE:To detect even a voice of a frictional sound by detecting a start point and an end point of a voiced section of an input voice by a dynamic feature of a voice spectral dispersion and energy. CONSTITUTION:The device is provided with an energy extracting part 20 for extracting the energy setting digital voice data in a prescribed section as one frame, an energy threshold calculating part 30 for adjusting an average value of background noise energy of a frame as an energy threshold, a spectral dispersion extracting part 61 for calculating a spectral dispersion by deriving an average value of a spectrum of the frame by a frequency of the frame, and a spectral dispersion threshold calculating part 71 for adjusting a spectral dispersion average value of a background noise as a spectral dispersion threshold. In this state, the energy and the spectral dispersion of each frame are compared with the energy threshold and the spectral dispersion threshold and whether it is a start point of a voice section or an end point is checked. In such a way, a voice start point of weak energy such as a frictional sound can be detected. |