摘要 |
A method and system are disclosed for recognizing speech errors, such as in a spoken short messages, using an audio input device to receive an utterance of a short message, using an automated speech recognition module to generate a text sentence corresponding to the utterance, generating an N-best list of predicted error sequences for the text sentence using a linear-chain conditional random field (CRF) module, where each word of the text sentence is assigned a label in each of the predicted error sequences, and each label is assigned a probability score. The predicted error sequence labels are rescored using a metacost matrix module, the best rescored error sequence from the N-best list of predicted error sequences is selected using a Recognition Output Voting Error Reduction (ROVER) module, and a dialog action is executed by a dialog action module based on the best rescored error sequence and the dialog action policy. |