发明名称 |
Using word confidence score, insertion and substitution thresholds for selected words in speech recognition |
摘要 |
A method and system for improving the accuracy of a speech recognition system using word confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP. |
申请公布号 |
US9478218(B2) |
申请公布日期 |
2016.10.25 |
申请号 |
US200812258093 |
申请日期 |
2008.10.24 |
申请人 |
Adacel, Inc. |
发明人 |
Shu Chang-Qing |
分类号 |
G10L15/00;G10L15/187 |
主分类号 |
G10L15/00 |
代理机构 |
Allen Dyer Doppelt Milbrath & Gilchrist |
代理人 |
Allen Dyer Doppelt Milbrath & Gilchrist |
主权项 |
1. A method for recognizing speech in acoustic data, comprising:
performing a tuning phase, the tuning phase further comprising: generating a series of hypothetical words (HYP) from a tuning audio data set in a decoder; and setting values of tunable parameters in the decoder to minimize a weighted total error rate (Wt Etotal); wherein the weighted total error is calculated according to an algorithm:
Wt Etotal=(λsub*num_error_sub_word+λins*num_error_ins_word+λdel*num_error_del_word)/total_num_RefWord, where λsub, λins, and λdel are weighting factors, at least two of which are different, num_error_sub_word is a number of substitution word errors, num_error_ins_word is a number of insertion word errors, num_error_del_word is a number of deletion word errors, and total_num_RefWord is the number of words in the transcription. |
地址 |
Quebec CA |