摘要 |
PROBLEM TO BE SOLVED: To provide a phoneme segmentation method and a device that extract and merge shape parameters of a speech waveform to specify the shape, i.e. to structure a shape fluctuation function, and perform precise phoneme segmentation. SOLUTION: For speech data read out of a speech data storage part 2, a zero-cross period of its speech waveform is computed as a 1st parameter, a peak level of a 1st maximum value between a zero cross and a zero cross of the speech waveform as a 2nd parameter, a maximum amplitude in each specified period which is updated between the zero cross and zero cross of the audio waveform as a 3rd parameter, the number of maximum values between the zero cross and zero cross point of the audio waveform as a 4th parameter, and the cross angle of the amplitude value at the time of zero crossing of the audio waveform with respect to a zero point, as a 5th parameter; and the 1st to 5th parameters are weighted respectively and results obtained by multiplying or adding the 1st to 5th parameters are compared with a threshold to decide phoneme borders, thereby performing speech segmentation. COPYRIGHT: (C)2007,JPO&INPIT
|