发明名称 Parse information encoding in a finite state transducer
摘要 In automatic speech recognition, certain parsing information, such as rules and tags, may be embedded into a finite state transducer (FST) to produce FST output that includes speech recognition results along with codes indicating parsing results of the recognized speech. The codes in the FST output may be formatted using a markup language, such as XML or JSON, for processing by a later application. The FST may be constructed according to a grammar defining the parsing information. The codes for inclusion in the FST output may be embedded into arcs of the FST and then included in the FST output when the speech recognition engine traverses the arcs of the FST.
申请公布号 US8972243(B1) 申请公布日期 2015.03.03
申请号 US201213681503 申请日期 2012.11.20
申请人 Amazon Technologies, Inc. 发明人 Strom Nikko;Ramakrishnan Karthik
分类号 G06F17/27;G10L21/00;G10L15/18 主分类号 G06F17/27
代理机构 Seyfarth Shaw LLP 代理人 Seyfarth Shaw LLP ;Barzilay Ilan N.
主权项 1. A method of performing speech recognition, the method comprising: creating a first finite state transducer (FST) using a speech recognition grammar, wherein a first arc of the first FST comprises a first semantic identifier and a second arc of the FST comprises a second semantic identifier; obtaining a second FST, wherein the second FST is for transducing speech recognition feature vectors to words; creating a third FST by composing the first FST and the second FST; receiving audio data comprising speech; performing speech recognition on the received audio data using the third FST to produce speech recognition results, wherein the speech recognition results comprise the first semantic identifier and the second semantic identifier; and processing the speech recognition results with an application, wherein the application processes the first semantic identifier and the second semantic identifier.
地址 Reno NV US