发明名称 METHOD, APPARATUS, AND PROGRAM FOR PHONEME DETERMINATION
摘要 PROBLEM TO BE SOLVED: To provide a method, an apparatus, and a program for phoneme determination that can accurately discriminate a vocal sound and accurately specify the section of the vocal section, i.e. vocal sound borders. SOLUTION: In the phoneme determining method, an input speech signal is divided into a plurality of bands, frame by frame, through mel frequency division (S1); power of each band is found and an acoustic feature quantity vector of each frame is generated (S2); and acoustic feature quantity vectors of those kinds are used to generate an HMM (Hidden Markov Model) as to respective vocal sounds or vocal sound boundaries. Computation is so carried out (S3) that a series of HMMs corresponding to previously know vocal sounds or vocal sound boundaries of the input speech signal, and a previously found feature quantity vector sequence and likelihood become maximum; and information (label) representing the vocal sounds or vocal sound boundaries is imparted to respective frames of the speech signal at this time (S4). COPYRIGHT: (C)2004,JPO
申请公布号 JP2004077901(A) 申请公布日期 2004.03.11
申请号 JP20020239448 申请日期 2002.08.20
申请人 NIPPON TELEGR & TELEPH CORP <NTT> 发明人 YONEZAWA TOMOKO;MIZUNO HIDEYUKI;ABE MASANOBU
分类号 G10L15/14;G10L13/06;G10L15/02;G10L15/04;G10L15/10;(IPC1-7):G10L13/06 主分类号 G10L15/14
代理机构 代理人
主权项
地址