摘要 |
A user uses voice commands to modify the contents of a visual display through an audio input device where the audio input device does not necessarily have speech recognition capabilities. The audio input device, such as a telephone, captures audio including spoken voice commands from a user and transmits the audio to a remote system. The remote system is configured to use automated speech recognition to recognize the voice commands. The recognized commands are interpreted by the remote system to respond to the user by transmitting data to be displayed on the visual display. The visual display can be integrated with the audio input device, such as in a web-enabled mobile phone, a video phone or an internet video phone, or the visual display can be separate, such as on a television or a computer display.
|