摘要 |
In a conditional multipass automatic speech recognition system, one or more intent templates may be received from an application. A spoken utterance is received and audio frames are generated from the utterance. The audio frames are compared to a first grammar. Recognized speech results are generated and unrecognized audio frames or low confidence frames are collected. One of one or more intent templates and one or more corresponding intent parameters may be determined based on the recognized speech results. The unrecognized audio frames may be conditionally compared to a second grammar in instances when additional information is requested, relative to the determined intent template or the corresponding intent parameters. |