发明名称 FEATURE WEIGHTING FOR NAIVE BAYES CLASSIFIERS USING A GENERATIVE MODEL
摘要 A method for classifying a new text document using a collection of training instances with class known and the class is not known, includes: first parameter learning step of estimating the word distribution θz for each class z; second parameter learning step of estimating the background distribution γ, and the degree of interpolation δ between γ and θz, such that the probability of observing the collection of all of the instances with known and unknown classes is maximized; classification step, including calculating for each word of a new instance, the probabilities that the word is generated from the word distribution θz and from the background distribution γ; combining the two probabilities using δ; and combining the probabilities of all words to estimate document probability for the class z that indicates the document generated from the class z; the new instance being classified as a class z* for which the document probability is the highest.
申请公布号 WO2015194052(A1) 申请公布日期 2015.12.23
申请号 WO2014JP67090 申请日期 2014.06.20
申请人 NEC CORPORATION 发明人 ANDRADE SILVA, DANIEL GEORG;MIZUGUCHI, HIRONORI;ISHIKAWA, KAI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址