摘要 |
Certain implementations of the disclosed technology include systems and methods for an enhanced speech recognition interface. According to an example implementation, a method includes outputting a first icon and second icon for presentation on a display device; responsive to receiving an indication of an input object being maintained at a first location of an input device, causing a recording device to record an audio signal; responsive to receiving an indication that the input object has moved across the input device from the first location of the input device to a second location of the input device, causing the recording device to stop recording the audio signal; outputting text, based on the recorded audio signal, for presentation on the display device; and responsive to receiving an indication of the input object being maintained at the second location of the input device, causing a portion of the text to be removed from presentation on the display device. |
主权项 |
1. A method comprising:
outputting, by a computing device, a first icon for presentation at a first location of a display device; outputting, by the computing device, a second icon for presentation at a second location of the display device; responsive to receiving, at the computing device, an indication of an input object being maintained, for at least a threshold amount of time, at a first location of an input device that is associated with the first location of the display device, causing, by the computing device, a recording device to record an audio signal; responsive to receiving, at the computing device, an indication that the input object has moved across the input device from the first location of the input device to a second location of the input device that is associated with the second location of the display device, causing, by the computing device, the recording device to stop recording the audio signal; outputting, by the computing device, text for presentation on the display device, the text being based on the recorded audio signal; responsive to receiving, at the computing device, an indication of the input object being maintained, for at least a threshold amount of time, at the second location of the input device, causing, by the computing device, a portion of the text to be removed from presentation on the display device. |