发明名称 Apparatus and method for analysis of language model changes
摘要 An apparatus, a method, and a machine-readable medium are provided for characterizing differences between two language models. A group of utterances from each of a group of time domains are examined. One of a significant word change or a significant word class change within the plurality of utterances is determined. A first cluster of utterances including a word or a word class corresponding to the one of the significant word change or the significant word class change is generated from the utterances. A second cluster of utterances not including the word or the word class corresponding to the one of the significant word change or the significant word class change is generated from the utterances.
申请公布号 US8892438(B2) 申请公布日期 2014.11.18
申请号 US201012881665 申请日期 2010.09.14
申请人 AT&T Intellectual Property II, L.P. 发明人 Gorin Allen Louis;Grothendieck John;Wright Jeremy Huntley Greet
分类号 G10L15/18;G10L15/183 主分类号 G10L15/18
代理机构 代理人
主权项 1. A method comprising: selecting a plurality of language models; for each period of a plurality of time periods: identifying a first utterance and a second utterance received during each time period, wherein the first utterance was recognized using a first language model of the plurality of language models and the second utterance was recognized using a second language model of the plurality of language models;identifying distinctions between the first utterance and the second utterance for each of the plurality of time periods;determining when a significant word usage change has occurred within the first language model and the second language model by comparing the distinctions to previously recorded distinctions; andwhen the significant word usage change is detected: identifying a word corresponding to the significant word usage change;generating, from the utterances, a first cluster of utterances comprising the word;generating, from the utterances, a second cluster of utterances not comprising the word; andupdating the plurality of language models using the first cluster of utterances and the second cluster of utterances.
地址 Atlanta GA US