发明名称 PHONEME SEGMENTATION METHOD AND DEVICE
摘要 PROBLEM TO BE SOLVED: To provide a phoneme segmentation method and a device that extract and merge shape parameters of a speech waveform to specify the shape, i.e. to structure a shape fluctuation function, and perform precise phoneme segmentation. SOLUTION: For speech data read out of a speech data storage part 2, a zero-cross period of its speech waveform is computed as a 1st parameter, a peak level of a 1st maximum value between a zero cross and a zero cross of the speech waveform as a 2nd parameter, a maximum amplitude in each specified period which is updated between the zero cross and zero cross of the audio waveform as a 3rd parameter, the number of maximum values between the zero cross and zero cross point of the audio waveform as a 4th parameter, and the cross angle of the amplitude value at the time of zero crossing of the audio waveform with respect to a zero point, as a 5th parameter; and the 1st to 5th parameters are weighted respectively and results obtained by multiplying or adding the 1st to 5th parameters are compared with a threshold to decide phoneme borders, thereby performing speech segmentation. COPYRIGHT: (C)2007,JPO&INPIT
申请公布号 JP2006284907(A) 申请公布日期 2006.10.19
申请号 JP20050104513 申请日期 2005.03.31
申请人 HOKKAIDO UNIV;CRYPTON FUTURE MEDIA INC 发明人 AOKI TADASHI;ITO HIROYUKI
分类号 G10L15/04;G10L15/02 主分类号 G10L15/04
代理机构 代理人
主权项
地址