摘要 |
PROBLEM TO BE SOLVED: To provide a method, a computer program and a processor for text speech synthesis that enables an unskilled operator to generate a very excellent spoken sound by minimum evaluation and correcting operation. SOLUTION: The method comprises: deriving at least one target unit sequence corresponding to the linguistic description; selecting from a waveform unit database for the target unit sequences a plurality of alternative unit sequences approximating the target unit sequences; concatenating the alternative unit sequences to alternative speech waveforms; and choosing one of the alternative speech waveforms by an operating person. There are no iterative cycles of manual modification and automatic selection, which enables a fast way of working. The operator does not need knowledge of units, targets, and costs, but chooses from a set of given alternatives. The fine-tuning of TTS prompts therefore becomes accessible to non-experts. COPYRIGHT: (C)2007,JPO&INPIT
|