发明名称 Voice retrieval device and voice retrieval method for detecting retrieval word from voice data
摘要 A voice retrieval device includes a processor; and a memory which stores a plurality of instructions, which when executed by the processor, cause the processor to execute: setting detection criteria for a retrieval word, based on a characteristic of the retrieval word, such that the higher the detection accuracy of the retrieval word or the lower the pronunciation difficulty of the retrieval word or the lower the appearance probability of the retrieval word, the stricter the detection criteria; performing first voice retrieval processing on voice data according to the detection criteria and detecting a section that possibly includes the retrieval word as a candidate section from the voice data; and performing second voice retrieval processing different from the first voice retrieval processing on each candidate section and determining whether or not the retrieval word is included in each candidate section.
申请公布号 US9466291(B2) 申请公布日期 2016.10.11
申请号 US201414515882 申请日期 2014.10.16
申请人 FUJITSU LIMITED 发明人 Tanaka Masakiyo;Iwamida Hitoshi;Washio Nobuyuki
分类号 G10L15/00;G10L15/08;G10L15/32 主分类号 G10L15/00
代理机构 Staas & Halsey LLP 代理人 Staas & Halsey LLP
主权项 1. A voice retrieval device comprising: a memory; and a processor coupled to the memory and configured to: set detection criteria to detect a retrieval word, based on a characteristic of the retrieval word, such that the higher the detection accuracy of the retrieval word or the lower the pronunciation difficulty of the retrieval word or the lower the appearance probability of the retrieval word, the less number of sections to be selected, as candidate sections, from voice data including a plurality of sections obtained by dividing the voice data into a plurality of frames, the voice data being recorded using a microphone; select part of the plurality of sections as the candidate sections which possibly include the retrieval word by performing first voice retrieval processing on the voice data according to the detection criteria, the first voice retrieval processing including calculating a matching score using the detection criteria for each of the plurality of sections included in the voice data, the matching score indicating a possibility of the retrieval word being included in each of the plurality of sections, according to the first voice retrieval processing, and detecting sections having the matching score that satisfies the detection criteria as the candidate sections; detect a section including the retrieval word by performing second voice retrieval processing using the detection criteria on each of the selected candidate sections, the second voice retrieval processing being different from the first voice retrieval processing; and output the detected section which includes the retrieval word.
地址 Kawasaki JP