发明名称 Grapheme-to-phoneme conversion using acoustic data
摘要 Described is the use of acoustic data to improve grapheme-to-phoneme conversion for speech recognition, such as to more accurately recognize spoken names in a voice-dialing system. A joint model of acoustics and graphonemes (acoustic data, phonemes sequences, grapheme sequences and an alignment between phoneme sequences and grapheme sequences) is described, as is retraining by maximum likelihood training and discriminative training in adapting graphoneme model parameters using acoustic data. Also described is the unsupervised collection of grapheme labels for received acoustic data, thereby automatically obtaining a substantial number of actual samples that may be used in retraining. Speech input that does not meet a confidence threshold may be filtered out so as to not be used by the retrained model.
申请公布号 US7991615(B2) 申请公布日期 2011.08.02
申请号 US20070952267 申请日期 2007.12.07
申请人 MICROSOFT CORPORATION 发明人 LI XIAO;GUNAWARDANA ASELA J. R.;ACERO ALEJANDRO
分类号 G10L15/04 主分类号 G10L15/04
代理机构 代理人
主权项
地址