发明名称 |
TRANSCRIPTION CORRECTION USING MULTI-TOKEN STRUCTURES |
摘要 |
Examples of the present disclosure describe generation of a multi-arc confusion network to improve, for example, an ability to return alternatives to output generated. A confusion network comprising token representations of lexicalized hypotheses and normalized hypotheses is generated. Each arc of the confusion network represents a token of a lexicalized hypothesis or a normalized hypothesis. The confusion network is transformed into a multi-arc confusion network, wherein the transforming comprising realigning at least one token of the confusion network to span multiple arcs of the confusion network. Other examples are also described. |
申请公布号 |
WO2016122967(A1) |
申请公布日期 |
2016.08.04 |
申请号 |
WO2016US14411 |
申请日期 |
2016.01.22 |
申请人 |
MICROSOFT TECHNOLOGY LICENSING, LLC |
发明人 |
LEVIT, MICHAEL;OZERTEM, UMUT;PARTHASARATHY, SARANGARAJAN;VARADHARAJAN, PADMA;RAGHUNATHAN, KARTHIK;ALPHONSO, ISSAC |
分类号 |
G10L15/08;G10L15/187;G10L15/197;G10L15/22 |
主分类号 |
G10L15/08 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|