发明名称 Incremental training for probabilistic categorizer
摘要 A probabilistic document categorizer has an associated vocabulary of words and an associated plurality of probabilistic categorizer parameters derived from a collection of documents. A new document is received. The probabilistic categorizer parameters are updated to reflect addition of the new document to the collection of documents based on vocabulary words contained in the new document, a category of the new document, and a collection size parameter indicative of an effective total number of instances of vocabulary words in the collection of documents.
申请公布号 US2007005340(A1) 申请公布日期 2007.01.04
申请号 US20050170019 申请日期 2005.06.29
申请人 XEROX CORPORATION 发明人 GOUTTE CYRIL;GAUSSIER ERIC
分类号 G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址