摘要 |
<p>An analyzing unit (1) converts an input speech into a feature vector time series. A reference pattern storing unit (3) stores the feature vector time series obtained by the same manner as in the analyzing unit. A matching unit (2) correlates for time axis the input speech feature vector time series and the reference patterns to one another. An environmental adapting unit (4) performs the environmental adaptation between the input speech feature vector time series and the reference patterns according to the result of matching in the matching unit (2). A speaker adapting unit (6) performs the adaptation concerning the speaker between the environmentally adapted reference patterns from the environmental adapting unit (4) and the input speech feature vector time series. <IMAGE></p> |