发明名称 Voice converter for assimilation by frame synthesis with temporal alignment
摘要 A voice converting apparatus is constructed for converting an input voice into an output voice according to a target voice. In the apparatus, a storage section provisionally stores source data, which is associated to and extracted from the target voice. An analyzing section analyzes the input voice to extract therefrom a series of input data frames representing the input voice. A producing section produces a series of target data frames representing the target voice based on the source data, while aligning the target data frames with the input data frames to secure synchronization between the target data frames and the input data frames. A synthesizing section synthesizes the output voice according to the target data frames and the input data frames. In the recognizing feature analysis, a characteristic analyzer extracts from the input voice a characteristic vector. A memory memorizes target behavior data representing a behavior of the target voice. An alignment processor determines a temporal relation between the input data frames and the target data frames according to the characteristic vector and the target behavior data so as to output alignment data. A target decoder produces the target data frames according to the alignment data, the input data frames and the source data containing phoneme of the target voice.
申请公布号 US2005049875(A1) 申请公布日期 2005.03.03
申请号 US20040951328 申请日期 2004.09.27
申请人 YAMAHA CORPORATION 发明人 KAWASHIMA TAKAHIRO;YOSHIOKA YASUO;CANO PEDRO;LOSCOS ALEX;SERRA XAVIER;SCHIEMENTZ MARK;BONADA JORDI
分类号 G10L13/02;G10L21/00;(IPC1-7):G10L13/00 主分类号 G10L13/02
代理机构 代理人
主权项
地址