摘要 |
<p>A system (60) for recognizing speech based on an input data stream indicative of the speech provides possible words represented by the input data stream as a prefix tree (88) including a plurality of phoneme branches connected at nodes. The plurality of phoneme branches is bracketed by at least one input silence branch (92) corresponding to a silence phone on an input side of the prefix tree and at least one output silence branch (94, 96, 98) corresponding to a silence phone on an output side of the prefix tree (60). The prefix tree (60) is traversed to obtain a word that is likely represented by the input data stream. The silence phones provided in the prefix tree can vary based on context.</p> |