发明名称 NEURAL NETWORK-BASED SPEECH TOKEN RECOGNITION SYSTEM AND METHOD
摘要 Improved system and method for speaker-independent speech token recognition are described. The system is neural network-based and involves processing a sequence of spoken utterance, e.g. separately articulated letters of a name, to identify the same based upon a highest probability match of each utterance with learned speech tokens, e.g. the letters of the English language alphabet, and based upon a highest probability match of the uttered sequence with a defined utterance library, e.g. a list of names. First, the spoken utterance is digitized or captured and processed into a spectral representation. Second, discrete time frames of the spectral representation are classified phonetically using the spectral coefficients. Third, the time-frame outputs are used by a modified Viterbi search to locate segment boundaries, near which such segment boundaries lies the information that is needed to discriminate letters. Fourth, the segmented or bounded representation is reclassified using such information into individual hypothesized letters. Fifth, successive, hypothesized letter scores are analyzed to obtain a high probability match with a spelled word within the utterance library. The system and method comprehend finer distinctions near points of interest used to discriminate difficult-to-recognize letter pair differences such as M/N, B/D, etc.. The system is described in the context of phone line reception of names spelled by remote users. <IMAGE>
申请公布号 AU3024892(A) 申请公布日期 1993.06.24
申请号 AU19920030248 申请日期 1992.12.18
申请人 OREGON GRADUATE INSTITUTE OF SCIENCE AND TECHNOLOGY 发明人 RONALD A. COLE;MARK A. FANTY
分类号 G10L15/16 主分类号 G10L15/16
代理机构 代理人
主权项
地址