发明名称 Application text entry in a mobile environment using a speech processing facility
摘要 In embodiments of the present invention improved capabilities are described for a mobile environment speech processing facility. The present invention may provide for the entering of text into a software application resident on a mobile communication facility, where recorded speech may be presented by the user using the mobile communications facility's resident capture facility. Transmission of the recording may be provided through a wireless communication facility to a speech recognition facility, and may be accompanied by information related to the software application. Results may be generated utilizing the speech recognition facility that may be independent of structured grammar, and may be based at least in part on the information relating to the software application and the recording. The results may then be transmitted to the mobile communications facility, where they may be loaded into the software application.
申请公布号 US8880405(B2) 申请公布日期 2014.11.04
申请号 US200711865692 申请日期 2007.10.01
申请人 Vlingo Corporation 发明人 Cerra Joseph P.;Kishchenko Roman V.;Nguyen John N.;Phillips Michael S.;Shu Han
分类号 G10L25/00;G10L15/30 主分类号 G10L25/00
代理机构 Holland & Knight LLP 代理人 Holland & Knight LLP ;Whittenberger, Esq. Mark H.
主权项 1. A method of entering text into a software application resident on a mobile communication facility comprising: recording speech presented by a user using a mobile communication facility resident capture facility; transmitting the recording through a wireless communication facility to a speech recognition facility; transmitting contextual information relating to the software application to the speech recognition facility, wherein the contextual information relating to the software application includes an identity of the application, an identity of the mobile communication facility, and contextual information within the application; wherein the speech recognition facility transmits the contextual information within the application to a first server, wherein the first server decides to process said contextual information within the application or alternatively, the first server transmits the contextual information within the application to a second server and wherein the speech recognition facility further transmits the identity of the application to the second server; selecting at least one statistical language model from a plurality of language models; generating results utilizing the speech recognition facility using the at least one statistical language model based at least in part on the information relating to the software application and the recording; transmitting the results to the mobile communications facility; and loading the results into the software application.
地址 Cambridge MA US