摘要 |
<p>A device (100) stores voice element information indicating voice elements that are capable of synthesizing a voice with higher authenticity, which indicates the degree of similarity to the voice from a person, than a predetermined reference value when used to synthesize a voice having the reference rhythm (a voice element information storage section (115)). The device receives requested rhythm information indicating a requested rhythm which is the rhythm requested by a user (a requested rhythm information receiving section (113)). The device generates intermediate rhythm information indicating an intermediate rhythm which is the rhythm between the reference rhythm and the requested rhythm (an intermediate rhythm information generating section (114)). The device performs voice synthesis processing which synthesizes the voice according to the generated intermediate rhythm information and the stored voice element information (a voice synthesis section (116)).</p> |