摘要 |
<p>Speech recognition is performed by receiving isolated speech training data (step 98) indicative of a plurality of discretely spoken training words, and receiving continuous speech training data (step 86) indicative of a plurality of continuously spoken training words. A plurality of speech unit models is trained based on the isolated speech training data and the continuous speech training data. Speech is recognized based on the speech unit models trained.</p> |