发明名称 |
Document and pattern clustering method and apparatus |
摘要 |
<p>In document (or pattern) clustering, the correct number of clusters and accurate assignment of each document (or pattern) to the correct cluster are attained. Documents (or patterns) describing the same topic (or object) are grouped, so a document (or pattern) group belonging to the same cluster has some commonality. Each topic (or object) has distinctive terms (or object features) or term (or object feature) pairs. When the closeness of each document (or pattern) to a given cluster is obtained, common information about the given cluster is extracted and used while the influence of terms (or object features) or term (or object feature) pairs not distinctive to the given cluster is excluded.</p> |
申请公布号 |
EP1455285(A3) |
申请公布日期 |
2006.12.20 |
申请号 |
EP20040251279 |
申请日期 |
2004.03.05 |
申请人 |
HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P. |
发明人 |
KAWATANI, TAKAHIKO |
分类号 |
G06F17/30;G06F7/00;G06F17/21;G06K9/62 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|