摘要 |
<p>This system for synthesizing speech by concatenating acoustic units comprises: phonetic transcription means (6) capable of generating a series of target acoustic units representative of the text to synthesize; means (7) for storing candidate acoustic units, each candidate acoustic unit comprising a pre-recorded fragment of speech; preselecting means (8) capable of producing a number of flows of candidate acoustic units, each flow being preselected based on a minimization of its overall cost, said overall call being the sum of cost functions that determine the cost between each target acoustic unit and the candidate acoustic units and functions of costs of the transitions between two candidate acoustic units, and; interface means (9) that enable an operator to compare the auditory quality of each preselected flow of candidate acoustic units for selecting the flow whose auditory quality seems the best to him.</p> |