摘要 |
There is provided amethod of synthesizing speech from a given text, said method comprising: -determining a sequence of phonetic components from the t ext; -determining a sequence of target phonetic elements from the sequence o f phonetic components; -determining a sequence of target event types from th e sequence of phonetic components; -determining a sequence of speech units f rom a plurality of stored speech unit candidates by use of a cost function, wherein the cost function comprises a unit cost,a concatenation cost, and an event type cost for each speech unit of the sequence of speech units, where in the unit cost of a speech unit is determined with respect to the correspo nding target phonetic element, wherein the concatenation cost of a speech un it is determined with respect to its adjacent speech units, and wherein the event type cost of each speech unit is determined with respect to the corres ponding target event type.
|