发明名称 DOCUMENT CLASSIFYING DEVICE
摘要 PROBLEM TO BE SOLVED: To adequately classify many documents which are linked complicatedly like hypertexts by generating an initial document cluster on the basis of link relation and document distances, taking a cluster analysis based upon the document distances, and classifying the documents. SOLUTION: A document storage part 11 stores electronized documents and a link relation storage part 12 stores the link relation among the documents stored in the document storage part 11. A distance calculating processing part 13 calculates the document distances from the appearance frequencies of words included in the respective documents stored in the document storage part 11 and then a document classifying processing part 14 generates the initial document cluster on the basis of the stored link relation and the obtained document distances and takes the cluster analysis based upon the document distances to classify the documents stored in the document storage part 11. Then an output processing part 15 outputs the classification result of the document classifying processing part 14.
申请公布号 JPH1027125(A) 申请公布日期 1998.01.27
申请号 JP19960199543 申请日期 1996.07.11
申请人 FUJI XEROX CO LTD 发明人 MASUICHI HIROSHI
分类号 G06F12/00;G06F17/21;G06F17/27;G06F17/30 主分类号 G06F12/00
代理机构 代理人
主权项
地址