发明名称 |
System and method of providing generated speech via a network |
摘要 |
A system and method of operating an automatic speech recognition application over an Internet Protocol network is disclosed. The ASR application communicates over a packet network such as an Internet Protocol network or a wireless network. A grammar for recognizing received speech from a user over the IP network is selected from a plurality of grammars according to a user-selected application. A server receives information representing speech over the IP network, performs speech recognition using the selected grammar, and returns information based upon the recognized speech. Sub-grammars may be included within the grammar to recognize speech from sub-portions of a dialog with the user. |
申请公布号 |
US9065914(B2) |
申请公布日期 |
2015.06.23 |
申请号 |
US201213527151 |
申请日期 |
2012.06.19 |
申请人 |
AT&T Intellectual Property II, L.P. |
发明人 |
Dragosh Pamela Leigh;Roe David Bjorn;Sharp Robert Douglas |
分类号 |
G10L21/00;H04M3/493;G10L15/30;G10L15/00;H04M1/64;H04M11/00;G06F13/00;G06F15/18;G06F15/00;G06F17/00;G06F15/16;H04R3/00;H04M7/00 |
主分类号 |
G10L21/00 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method comprising:
selecting a spoken dialog application from a plurality of spoken dialog applications; transmitting, over a network, an identification of the selected spoken dialog application, the spoken dialog application having a grammar identifier; selecting a grammar from a plurality of grammars based on the grammar identifier, wherein the grammar is provided by the selected spoken dialog application and chosen from a predetermined group of grammars based upon information provided by the selected spoken dialog application; transmitting digitized user speech over the network while receiving user speech which is digitized into the digitized user speech; receiving partially synthesized speech in response to the digitized user speech, wherein the selected spoken dialog application recognizes the digitized user speech using the grammar; and receiving final synthesized speech in response to the digitized user speech, wherein the receiving of the final synthesized speech occurs after receiving the partially synthesized speech. |
地址 |
Atlanta GA US |