发明名称 Method, system and computer program for generating a language model
摘要 A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system may identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.
申请公布号 EP1787287(B1) 申请公布日期 2016.10.05
申请号 EP20050786418 申请日期 2005.08.18
申请人 MULTIMODAL TECHNOLOGIES, LLC 发明人 YEGNANARAYANAN, GIRIJA;FINKE, MICHAEL;FRITSCH, JUERGEN;KOLL, DETLEF;WOSZCZYNA, MONIKA
分类号 G10L15/06;G10L15/193;G10L15/26 主分类号 G10L15/06
代理机构 代理人
主权项
地址