发明名称 TRANSCRIPTION CORRECTION USING MULTI-TOKEN STRUCTURES
摘要 Examples of the present disclosure describe generation of a multi-arc confusion network to improve, for example, an ability to return alternatives to output generated. A confusion network comprising token representations of lexicalized hypotheses and normalized hypotheses is generated. Each arc of the confusion network represents a token of a lexicalized hypothesis or a normalized hypothesis. The confusion network is transformed into a multi-arc confusion network, wherein the transforming comprising realigning at least one token of the confusion network to span multiple arcs of the confusion network. Other examples are also described.
申请公布号 WO2016122967(A1) 申请公布日期 2016.08.04
申请号 WO2016US14411 申请日期 2016.01.22
申请人 MICROSOFT TECHNOLOGY LICENSING, LLC 发明人 LEVIT, MICHAEL;OZERTEM, UMUT;PARTHASARATHY, SARANGARAJAN;VARADHARAJAN, PADMA;RAGHUNATHAN, KARTHIK;ALPHONSO, ISSAC
分类号 G10L15/08;G10L15/187;G10L15/197;G10L15/22 主分类号 G10L15/08
代理机构 代理人
主权项
地址