发明名称 Searching in audio speech
摘要 A computerized method of detecting a target word in a speech signal. A speech recognition engine and a previously constructed phoneme model is provided. The speech signal is input into the speech recognition engine. Based on the phoneme model, the input speech signal is indexed. A time-ordered list is stored representing n-best phoneme candidates of the input speech signal and phonemes of the input speech signal in multiple phoneme frames. The target word is transcribed into a transcription of target phonemes. The time-ordered list of n-best phoneme candidates is searched for a locus of said target phonemes. While searching, scoring is based on the ranking of the phoneme candidates among the n-best phoneme candidates and based on the number of the target phonemes found. A composite score of the probability of an occurrence of the target word is produced. When the composite score is higher than a threshold, start and finish times are output which bound the locus. The start and finish times are input into an algorithm adapted for sequence alignment based on dynamic programming for aligning a portion of the phoneme frames with the target phonemes.
申请公布号 US8321218(B2) 申请公布日期 2012.11.27
申请号 US20090488097 申请日期 2009.06.19
申请人 FAIFKOV RONEN;COHEN-TOV RABIN;SIMONE ADAM;L.N.T.S. LINGUISTECH SOLUTIONS LTD 发明人 FAIFKOV RONEN;COHEN-TOV RABIN;SIMONE ADAM
分类号 G10L15/00 主分类号 G10L15/00
代理机构 代理人
主权项
地址