发明名称 Speech samples library for text-to-speech and methods and apparatus for generating and using same
摘要 A method for converting translating text into speech with a speech sample library is provided. The method comprises converting translating an input text to a sequence of triphones; determining musical parameters of each phoneme in the sequence of triphones; detecting, in the speech sample library, speech segments having at least the determined musical parameters; and concatenating the detected speech segments.
申请公布号 US8775185(B2) 申请公布日期 2014.07.08
申请号 US201213686140 申请日期 2012.11.27
申请人 发明人 Silbert Gershon;Hakim Andres
分类号 G10L13/06 主分类号 G10L13/06
代理机构 M&B IP Analysts, LLC 代理人 M&B IP Analysts, LLC
主权项 1. A method for converting text into speech with a speech sample library, comprising: providing an input text; converting the input text to a sequence of triphones; retrieving phonemic contexts of the sequence of triphones; determining musical parameters characterizing each phoneme in the sequence of triphones; predicting a set of numerical targets for the determined musical parameters, wherein the set of numerical targets is provided for each of the musical parameters; detecting, in the speech sample library, pre-stored speech segments having at least the determined musical parameters of each phoneme in the sequence of triphones based on the phonemic contexts and the predicted set of numerical targets for the determined musical parameters which lie within a range of musical parameters of the pre-stored speech segments, wherein the detection of the pre-stored speech segments further includes searching the speech sample library for at least one of a central phoneme, phonemic context, and a musical index indicating at least one range of at least one of the musical parameters within which at least one of the numerical targets lies; and concatenating the detected speech segments.
地址