发明名称 System And Method For Clustering Unstructured Documents
摘要 A system and method for clustering unstructured documents is provided. Documents having terms with frequencies of occurrence that satisfy upper and lower edge conditions are selected. Concepts are generated for the selected documents. The selected documents are grouped into clusters of the documents. A weight for each of the clusters is evaluated. A similarity value is determined from the frequencies of occurrence for at least one of the terms from the concepts and the cluster weights for each selected document. Each selected document is assigned into one such cluster based on the similarity value of the selected document.
申请公布号 US2008104063(A1) 申请公布日期 2008.05.01
申请号 US20070964000 申请日期 2007.12.24
申请人 GALLIVAN DAN;KAWAI KENJI 发明人 GALLIVAN DAN;KAWAI KENJI
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 代理人
主权项
地址