发明名称 VOICE PROCESSING APPARATUS AND PROGRAM
摘要 PROBLEM TO BE SOLVED: To segment a voice signal including voice of a plurality of speakers into sections for each speaker. SOLUTION: A voice segmentation section 12 specifies an envelope E of a waveform of the voice signal S including voice of the plurality of speakers, and detects a plurality of valleys D in the envelope E. The valley D is a boundary between a section where a level of the envelope E continuously decreases for a prescribed time period, and a section where the level of the envelope E continuously increases for a prescribed time period. The voice segmentation section 12 segments the voice signal S into the plurality of sections B by setting each valley D as the boundary. Moreover, the voice segmentation section 12 specifies a peak value Lp for a plurality of peaks P of the envelope E, and determines that the section B including the peak P where the peak value Lp is lower than a threshold TH in the plurality of sections B, is a silent section. COPYRIGHT: (C)2009,JPO&INPIT
申请公布号 JP2009020457(A) 申请公布日期 2009.01.29
申请号 JP20070184871 申请日期 2007.07.13
申请人 UNIV WASEDA;YAMAHA CORP 发明人 HIGASHIYAMA MIKIO;KAZAMA MICHIKO;YOSHIOKA YASUO
分类号 G10L21/02;G10L11/00 主分类号 G10L21/02
代理机构 代理人
主权项
地址