摘要 |
A search word acquiring unit acquires a search word. A converting unit converts the search word into a phoneme sequence. An output probability acquiring unit acquires, for each frame, an output probability of a feature quantity of a target voice signal being output from each phoneme included in the phoneme sequence. A relative calculating unit executes relative calculation of the output probability acquired from each phoneme by the output probability acquirer, based on an output probability acquired from another phoneme included in the phoneme sequence. A zone designating unit successively designates a likelihood acquisition zones. A likelihood calculating unit acquires a likelihood indicating how likely a likelihood acquisition zone designated by the zone designator is a zone in which voice corresponding to the search word is spoken. An identifying unit identifies from the target voice signal an estimated zone for which the voice corresponding to the search word is estimated to be spoken, based on the likelihood acquired by the likelihood acquiring unit. |