发明名称 Method and apparatus for utterance verification
摘要 A method and apparatus for utterance verification are provided for verifying a recognized vocabulary output from speech recognition. The apparatus for utterance verification includes a reference score accumulator, a verification score generator and a decision device. A log-likelihood score obtained from speech recognition is processed by taking a logarithm of the value of the probability of one of feature vectors of an input speech conditioned on one of states of each model vocabulary. A verification score is generated based on the processed result. The verification score is compared with a predetermined threshold value so as to reject or accept the recognized vocabulary.
申请公布号 US8972264(B2) 申请公布日期 2015.03.03
申请号 US201213717645 申请日期 2012.12.17
申请人 Industrial Technology Research Institute 发明人 Chien Shih-Chieh
分类号 G10L15/00 主分类号 G10L15/00
代理机构 Jianq Chyun IP Office 代理人 Jianq Chyun IP Office
主权项 1. A method for utterance verification adapted to verify a recognized vocabulary, wherein the recognized vocabulary is obtained by performing speech recognition on a feature vector sequence according to an acoustic model and model vocabulary database, wherein the feature vector sequence comprises feature vectors of a plurality of frames, wherein the acoustic model and model vocabulary database comprises a plurality of model vocabularies, wherein each of the model vocabularies comprises a plurality of states, and wherein the method for utterance verification comprises: calculating a maximum reference score for each of the model vocabularies according to a log-likelihood score obtained from speech recognition, wherein the log-likelihood score obtained from speech recognition is calculated by taking a logarithm on a value of a probability of one of the feature vectors of the frames conditioned on one of the states of each model vocabulary, and wherein the maximum reference score is a summation of the maximum value of log-likelihood scores of the feature vector of each frame conditioned on each state of a certain model vocabulary; calculating a first verification score according to an optimal path score output during the speech recognition and the maximum reference score; and comparing the first verification score with a first predetermined threshold value, so as to reject or accept the recognized vocabulary.
地址 Hsinchu TW