发明名称 METHOD AND APPARATUS FOR TRAINING A PROSODY STATISTIC MODEL AND PROSODY PARSING, METHOD AND SYSTEM FOR TEXT TO SPEECH SYNTHESIS
摘要 The present invention provides a method and apparatus for training a prosody statistic model and prosody parsing, a method and system for text to speech synthesis. Said method for training a prosody statistic model with a raw corpus that includes a plurality of sentences with punctuation, comprising: transforming said plurality of sentences in said raw corpus into a plurality of token sequences respectively; counting a frequency for each adjacent token pair occurring in said plurality of token sequences and frequencies of punctuation that represents a pause occurring at associated positions of said each token pair; calculating pause probabilities at said associated positions of said each token pair; and constructing said prosody statistic model based on said token pairs and said pause probabilities at associated positions thereof. With the present invention a prosody statistic model can be trained from a raw corpus without manually prosody parsing tags. And the prosody statistic model can be used in the prosody parsing and further voice synthesis.
申请公布号 US2007129938(A1) 申请公布日期 2007.06.07
申请号 US20060539434 申请日期 2006.10.06
申请人 KABUSHIKI KAISHA TOSHIBA 发明人 WANG HAIFENG;LI GUOHUA
分类号 G06F17/21 主分类号 G06F17/21
代理机构 代理人
主权项
地址