发明名称 AUTOMATED EXTRACTION OF SEMANTIC CONTENT AND GENERATION OF A STRUCTURED DOCUMENT FROM SPEECH
摘要 Techniques are disclosed for automatically generating structured documents (310) based on speech (302), including identification of relevant concepts and their interpretation. In one embodiment, a structured document generator (308) uses an integrated process to generate a structured textual document (such as a structured textual medical report) based on a spoken audio stream (302). The spoken audio stream may be recognized using a language model (304), which includes a plurality of sub-models (306a, 306b, 306c, 306d, 306e) arranged in a hierarchical structure. Each of the sub-models may correspond to a concept that is expected to appear in the spoken audio stream. Different portions of the spoken audio stream may be recognized using different sub-models. The resulting structured textual document may have a hierarchical structure that corresponds to the hierarchical structure of the language sub-models that were used to generate the structured textual document.
申请公布号 WO2006023622(A3) 申请公布日期 2007.04.12
申请号 WO2005US29354 申请日期 2005.08.18
申请人 MULTIMODAL TECHNOLOGIES, INC.;FRITSCH, JUERGEN;FINKE, MICHAEL;KOLL, DETLEF;WOSZCZYNA, MONIKA;YEGNANARAYANAN, GIRIJA 发明人 FRITSCH, JUERGEN;FINKE, MICHAEL;KOLL, DETLEF;WOSZCZYNA, MONIKA;YEGNANARAYANAN, GIRIJA
分类号 G10L15/00;G06F17/27;G10L15/26 主分类号 G10L15/00
代理机构 代理人
主权项
地址