Method and apparatus for utterance verification,申请号US201213717645-传众专利搜索

发明名称	Method and apparatus for utterance verification
摘要	A method and apparatus for utterance verification are provided for verifying a recognized vocabulary output from speech recognition. The apparatus for utterance verification includes a reference score accumulator, a verification score generator and a decision device. A log-likelihood score obtained from speech recognition is processed by taking a logarithm of the value of the probability of one of feature vectors of an input speech conditioned on one of states of each model vocabulary. A verification score is generated based on the processed result. The verification score is compared with a predetermined threshold value so as to reject or accept the recognized vocabulary.
申请公布号	US8972264(B2)	申请公布日期	2015.03.03
申请号	US201213717645	申请日期	2012.12.17
申请人	Industrial Technology Research Institute	发明人	Chien Shih-Chieh
分类号	G10L15/00	主分类号	G10L15/00
代理机构	Jianq Chyun IP Office	代理人	Jianq Chyun IP Office
主权项	1. A method for utterance verification adapted to verify a recognized vocabulary, wherein the recognized vocabulary is obtained by performing speech recognition on a feature vector sequence according to an acoustic model and model vocabulary database, wherein the feature vector sequence comprises feature vectors of a plurality of frames, wherein the acoustic model and model vocabulary database comprises a plurality of model vocabularies, wherein each of the model vocabularies comprises a plurality of states, and wherein the method for utterance verification comprises: calculating a maximum reference score for each of the model vocabularies according to a log-likelihood score obtained from speech recognition, wherein the log-likelihood score obtained from speech recognition is calculated by taking a logarithm on a value of a probability of one of the feature vectors of the frames conditioned on one of the states of each model vocabulary, and wherein the maximum reference score is a summation of the maximum value of log-likelihood scores of the feature vector of each frame conditioned on each state of a certain model vocabulary; calculating a first verification score according to an optimal path score output during the speech recognition and the maximum reference score; and comparing the first verification score with a first predetermined threshold value, so as to reject or accept the recognized vocabulary.
地址	Hsinchu TW