发明名称 |
Speech recognition with hierarchical networks |
摘要 |
Provided are systems and methods for using hierarchical networks for recognition, such as speech recognition. Conventional automatic recognition systems may not be both efficient and flexible. Recognition systems are disclosed that may achieve efficiency and flexibility by employing hierarchical networks, prefix consolidation of networks, and future consolidation of networks. The disclosed networks may be associated with a network model and the associated network model may be modified during recognition to achieve greater flexibility. |
申请公布号 |
US9093061(B1) |
申请公布日期 |
2015.07.28 |
申请号 |
US201213434315 |
申请日期 |
2012.03.29 |
申请人 |
Canyon IP Holdings, LLC. |
发明人 |
Secker-Walker Hugh;Basye Kenneth J.;Krishnamoorthy Mahesh |
分类号 |
G10L15/14;G10L15/00;G10L15/06 |
主分类号 |
G10L15/14 |
代理机构 |
Knobbe, Martens, Olson & Bear, LLP |
代理人 |
Knobbe, Martens, Olson & Bear, LLP |
主权项 |
1. A computer-implemented method for performing speech recognition, comprising:
selecting, via at least one computer processor configured to execute specific instructions, a first set of word candidates from a plurality of word tokens, wherein the plurality of word tokens are associated with a language model and a word network of a hierarchy of networks; selecting, via the at least one computer processor, a first set of speech unit candidates from a plurality of speech unit tokens, wherein the plurality of speech unit tokens are associated with a speech unit model and a speech unit network of the hierarchy of networks, wherein a word token of the plurality of word tokens corresponds to one or more speech tokens of the plurality of speech tokens; receiving, via the at least one computer processor, audio input, wherein the audio input was captured via a microphone; selecting, via the at least one computer processor, a second set of speech unit candidates from the plurality of speech unit tokens using the audio input and the first set of speech unit candidates; recognizing, via the at least one computer processor, a word candidate in the first set of word candidates based at least partly on a correspondence of the word candidate to one or more speech unit candidates of the second set of speech unit candidates; and selecting, via the at least one computer processor, a second set of word candidates from the plurality of word tokens based at least partly on recognition of the word candidate. |
地址 |
Wilmington DE US |