发明名称 Method and apparatus for voice-enabling an application
摘要 A method of voice-enabling an application for command and control and content navigation can include the application dynamically generating a markup language fragment specifying a command and control and content navigation grammar for the application, instantiating an interpreter from a voice library, and providing the markup language fragment to the interpreter. The method also can include the interpreter processing a speech input using the command and control and content navigation grammar specified by the markup language fragment and providing an event to the application indicating an instruction representative of the speech input.
申请公布号 US8768711(B2) 申请公布日期 2014.07.01
申请号 US200410870517 申请日期 2004.06.17
申请人 Nuance Communications, Inc. 发明人 Ativanichayaphong Soonthorn;Cross, Jr. Charles W.;Muschett Brien H.
分类号 G10L15/00;G10L21/00;G10L25/00;G06F3/00;H04M1/64;H04M11/00 主分类号 G10L15/00
代理机构 Wolf, Greenfield & Sacks, P.C. 代理人 Wolf, Greenfield & Sacks, P.C.
主权项 1. A method of voice-enabling an application comprising a visual browser that lacks the ability to process audible input and/or output, the method comprising: dynamically identifying, via the application comprising the visual browser and based on a current state of the application, one or more commands and/or controls that are used to interact with the visual browser via graphic input and that are not responsive to speech; generating, via the application comprising the visual browser, at least one markup language fragment specifying a voice grammar corresponding to the identified one or more commands and/or controls that are not responsive to speech; instantiating, by the application and through a voice library of voice markup language functions, an interpreter by calling at least one function in the voice library via a library application programming interface (API) through which the visual browser and the voice library can communicate; providing the at least one markup language fragment from the application that instantiated the interpreter to the interpreter for use in recognizing speech; receiving, via the application, a speech input from a user; receiving, via the application from the interpreter, an event specifying at least one of the identified one or more commands and/or controls, generated as a result of matching, by the interpreter, the speech input with the voice grammar specified by the at least one markup language fragment to resolve the speech input to the at least one of the identified one or more commands and/or controls so that the event includes at least one attribute specifying a semantic interpretation of the speech input thus rendering the at least one of the identified one or more commands and/or controls responsive to speech; and interpreting the event via the application.
地址 Burlington MA US
您可能感兴趣的专利