摘要 |
PROBLEM TO BE SOLVED: To provide a speech data retrieval system capable of retrieving a part where a user-specified keyword is uttered, with high accuracy and high speed, even when a speech data becomes a large amount. SOLUTION: Candidate periods are narrowed down, in advance, on the basis of a sequence of subwords generated from a keyword, and then the count values of the candidate periods containing the subwords are each calculated by adding up certain values. Through such a simple process, the candidate periods are prioritized and then selected as retrieved results. In addition, the sequence of subwords generated from the keyword is complemented, assuming that speech recognition errors occur, and then, candidate period generation and selection are performed, on the basis of the complemented sequence of subwords. COPYRIGHT: (C)2009,JPO&INPIT |