发明名称 AUTOMATIC CATEGORIZATION OF DOCUMENTS BASED ON TEXTUAL CONTENT
摘要 An electronic device automatically classifies documents based upon textual content. Documents may be classified into document categories. Statistical characteristics are gathered for each document category and these statistical characteristics are used as a frame of reference in determining how to classify the document. The document categories may be intersecting or non-intersecting. A neutral category is used to represent documents that do not fit fit into many of the other specified categories. The statistical characteristic for an input document are compared with those for the document category and for the neutral category in making a determination on how to categorize the document. This approach is extensible, generalizable and efficient.
申请公布号 WO0213055(A2) 申请公布日期 2002.02.14
申请号 WO2001US41669 申请日期 2001.08.09
申请人 ELRON SOFTWARE, INC. 发明人 SMADJA, FRANK
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址