发明名称 Technique which utilizes a probabilistic classifier to detect junk e-mail by automatically updating a training set and re-training the classifier based on the updated training set
摘要 A technique, specifically a method and apparatus that implements the method, which through a probabilistic classifier (370) and, for a given recipient, detects electronic mail (e-mail) messages, in an incoming message stream, which that recipient is likely to consider "junk". Specifically, the invention discriminates message content for that recipient, through a probabilistic classifier (e.g., a support vector machine) trained on prior content classifications. Through a resulting quantitative probability measure, i.e., an output confidence level, produced by the classifier for each message and subsequently compared against a predefined threshold, that message is classified as either, e.g., spam or legitimate mail, and, e.g., then stored in a corresponding folder (223, 227) for subsequent retrieval by and display to the recipient. Based on the probability measure, the message can alternatively be classified into one of a number of different folders, depicted in a pre-defined visually distinctive manner or simply discarded in its entirety.
申请公布号 US6161130(A) 申请公布日期 2000.12.12
申请号 US19980102837 申请日期 1998.06.23
申请人 MICROSOFT CORPORATION 发明人 HORVITZ, ERIC;HECKERMAN, DAVID E.;DUMAIS, SUSAN T.;SAHAMI, MEHRAN;PLATT, JOHN C.
分类号 G06F17/30;G06K9/62;G06Q10/00;H04L12/58;(IPC1-7):G06F15/16;G06F15/173 主分类号 G06F17/30
代理机构 代理人
主权项
地址