发明名称 Speech recognition semantic classification training
摘要 An automated method is described for developing an automated speech input semantic classification system such as a call routing system. A set of semantic classifications is defined for classification of input speech utterances, where each semantic classification represents a specific semantic classification of the speech input. The semantic classification system is trained from training data from training data substantially without manually transcribed in-domain training data, and then operated to assign input speech utterances to the defined semantic classifications. Adaptation training data based on input speech utterances is collected with manually assigned semantic labels from at least one source of already collected language data. When the adaptation training data satisfies a pre-determined adaptation criteria, the semantic classification system is automatically retrained based on the adaptation training data.
申请公布号 US9620110(B2) 申请公布日期 2017.04.11
申请号 US201414262919 申请日期 2014.04.28
申请人 Nuance Communications, Inc. 发明人 Duta Nicolae;Tremblay Réal;Mauro Andrew D.;Peters S. Douglas
分类号 G10L15/06;G10L15/18;G10L15/01;G10L15/183 主分类号 G10L15/06
代理机构 Hamilton, Brook, Smith & Reynolds, P.C. 代理人 Hamilton, Brook, Smith & Reynolds, P.C.
主权项 1. A computer based method implemented using at least one hardware implemented processor for developing an automated speech input semantic classification system, the method comprising: using the at least one hardware implemented processor to perform the steps of: defining a set of semantic classifications for classification of input speech utterances, each semantic classification representing a specific semantic classification of the speech input;training the semantic classification system from training data, the training including a) generating a plurality of subset language models, each of the subset language models corresponding to a respective subset of the training data, and b) applying each of the plurality of subset language models to recognize a non-respective subset of the training data;operating the semantic classification system to assign input speech utterances to the defined semantic classifications;obtaining adaptation training data based on input speech utterances with manually assigned semantic labels from at least one source of already collected language data; andwhen the adaptation training data satisfies a pre-determined adaptation criteria, automatically retraining the semantic classification system based on the adaptation training data.
地址 Burlington MA US