发明名称 Method for developing a classifier for classifying communications
摘要 A computer assisted/implemented method for developing a classifier for classifying communications includes roughly four stages, where these stages are designed to be iterative: (1) a stage defining where and how to harvest messages (i.e., from Internet message boards, ews groups and the like), which also defines an expected domain of application for the lassifier; (2) a guided question/answering stage for the computerized tool to elicit the user's criteria for determining whether a message is relevant or irrelevant; (3) a labeling stage where the user examines carefully-selected messages and provides feedback about whether or not it is relevant and sometimes also what elements of the criteria were used to make the decision; and (4) a performance evaluation stage where parameters of the classifier training are optimized, the best classifier is produced, and known performance bounds are calculated. In the guided question/answering stage, the criteria are parameterized in such a way that (a) they can be operationalized into the text classifier through key words and phrases, and (b) a human-readable criteria can be produced, which can be reviewed and edited. The labeling phase is oriented toward an extended Active Learning framework. That is, the exemplary embodiment decides which example messages to show the user based upon what category of messages the system thinks would be most useful to the Active Learning process.
申请公布号 US7725414(B2) 申请公布日期 2010.05.25
申请号 US20040801758 申请日期 2004.03.16
申请人 BUZZMETRICS, LTD AN ISRAEL CORPORATION 发明人 NIGAM KAMAL P.;STOCKTON ROBERT G.
分类号 G06N5/04;G06F7/00;G06F17/30 主分类号 G06N5/04
代理机构 代理人
主权项
地址