发明名称 Language model adaptation based on filtered data
摘要 A method for adapting a language model for a context of a domain, comprising obtaining textual contents from a large source by a request directed to the context of the domain, discarding at least a part of the textual contents that contain textual terms determined as irrelevant to the context of the domain, thereby retaining, as retained data, at least a part of the textual contents that contain textual terms determined as relevant to the context of the domain, and adapting the language model by incorporating therein at least a part of the textual terms of the retained data, wherein the method is performed on an at least one computerized apparatus configured to perform the method and equipped for communication with the large source, and an apparatus for performing the same.
申请公布号 US9564122(B2) 申请公布日期 2017.02.07
申请号 US201414224086 申请日期 2014.03.25
申请人 NICE LTD. 发明人 Bretter Ronny;Artzi Shimrit;Nissan Maor
分类号 G10L15/00;G06F17/27 主分类号 G10L15/00
代理机构 代理人 Nordman Soroker Agmon
主权项 1. A method for adapting a language model for a context of a domain, comprising; from a source having textual information with a variety of phrases related to the context of the domain obtaining textual contents as data directed to the context of the domain by querying the source with phrases representative of the subject matter of the domain regardless and irrespective of any language model; responsive to a state of a provided selector, determining is one state semantic relevancy or in another state semantic relevancy and lexical relevancy of the textual contents to the context of the domain; discarding at least a part of the textual contents that contain textual terms determined as irrelevant to the context of the domain, thereby retaining, as retained data, at least a part of the textual contents that contain textual terms determined as relevant to the context of the domain; and adapting the language model by incorporating therein at least a part of the textual terms of the retained data, wherein the method is performed on an at least one computerized apparatus configured to perform the method and equipped for communication with the source.
地址 Ra'anana IL