摘要 |
Systems, methods and apparatus for generating, distributing, and using speech recognition models. A shared speech processing facility is used to support speech recognition for a wide variety of devices with limited capabilities including business computer systems, personal data assistants, etc., which are coupled to the speech processing facility via a communications channel, e.g., the Internet. Devices with audio capture capability record and transmit to the speech processing facility, via the Internet, digitized speech and receive speech processing services, e.g., speech recognition model generation and/or speech recognition services, in response. The Internet is used to return speech recognition models and/or information identifying recognized words or phrases. The speech processing facility can be used to provide speech recognition capabilities to devices without such capabilities and/or to augment a device's speech processing capability. Voice dialing, telephone control and/or other services are provided by the speech processing facility in response to speech recognition results. |
主权项 |
1. A computer-implemented method comprising:
receiving, at a server and from a mobile device, a request including a speech data representation of an utterance or feature data extracted from the speech data representation of the utterance; obtaining, by the server, a transcription of the utterance by applying a speech recognition model to the speech data representation of the utterance or the feature data extracted from the speech data representation of the utterance; identifying, by the server, a keyword based on the transcription of the utterance; and initiating a communication between the mobile device and another device based on the identified keyword. |