发明名称 Classification method and apparatus
摘要 <p>A method for building a classification model for classifying unclassified documents based on the classification of a plurality of documents which respectively have been classified as belonging to one of a plurality of classes, said documents being digitally represented in a computer, said documents respectively comprising a plurality of terms which respectively comprise one or more symbols of a finite set of symbols, and said method comprising the following steps: representing each of said plurality of documents by a vector of n dimensions, said n dimensions forming a vector space, whereas the value of each dimension of said vector corresponds to the frequency of occurrence of a certain term in the document corresponding to said vector, so that said n dimensions span up a vector space; representing the classification of said already classified documents into classes by separating said vector space into a plurality of subspaces by one or more hyperplanes, such that each subspace comprises one or more documents as represented by their corresponding vectors in said vector space, so that said each subspace corresponds to a class. <IMAGE></p>
申请公布号 EP1049030(A1) 申请公布日期 2000.11.02
申请号 EP19990108354 申请日期 1999.04.28
申请人 SER SYSTEME AG PRODUKTE UND ANWENDUNGEN DER DATENVERARBEITUNG 发明人 RUJAN, PAL;URBSCHAT, HARRY
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址