摘要 |
<p>The word-spotting apparatus is provided with: a feature parameter generator (5) which extracts a speech segment from an input utterance, divides it into frames, and generates feature parameters of the utterance; an acoustic model storage (6) which stores feature parameters of speech at a subword level; keyword model generator (8) which generates a keyword model using pronunciation data of a keyword outputted from a keyword storage (7) and feature parameters outputted from the acoustic model storage (6); a keyword likelihood calculator (11, 21) which calculates keyword similarity between the feature parameters of the utterance and feature parameters of the keyword model; and the Viterbi processor (14, 24, 32, 42) which calculates cumulative similarity of the keyword model.</p> |