摘要 |
<p>The pitch of synthesised speech signals is varied by separating the speech s ignals into a spectral component and an excitation component. The latter is multiplied by a series of overlapping window funct ions synchronous, in the case of voiced speech, with pit ch timing mark information corresponding at least approximately to instants of vocal excitation, to separate it into windowed speech se gments which are added together again after the application of a controllable time- shift. The spectral and excitation components are then r ecombined. The multiplication employs at least two windows per pitch period, each havin g a duration of less than one pitch period. Alternativel y each window has a duration of less than twice the pitch period between timing mar ks and is asymmetric about the timing mark.</p> |