摘要 |
PROBLEM TO BE SOLVED: To provide a method and an apparatus for improving the quality of speech synthesis in a speech synthesis apparatus for inputting a text and a speech pattern and obtaining a speech synthesis output corresponding to these input contents. SOLUTION: A pronunciation information generation means 10 inputs a text α and speech pattern information β which is a factor for changing a speech other than contents expressed by the text α and outputs one or more pronunciation information and a speech pattern score expressing a degree to which speech patterns corresponding to respective pronunciation information are reflected. A rhythm information generation means 12 inputs the speech pattern score and the pronunciation information and outputs one or more rhythm information in each pronunciation information. A synthetic speech selection means 16 selects and outputs synthetic speech having a quality score exceeding a threshold and the highest pattern score based on the pronunciation pattern score and the rhythm pattern score, and when there is no synthetic speech having a quality score exceeding the threshold, selects and outputs the synthetic speech having the highest quality score. COPYRIGHT: (C)2008,JPO&INPIT |