发明名称 |
APPARATUS FOR PROVIDING DOCUMENT CLUSTERING USING RE-WEIGHTED TERM |
摘要 |
A clustering apparatus and a method thereof are presented to minimize semantic difference between a cluster required by a user and a cluster provided by a system. According to a document clustering apparatus using a terminology weight recalculation, inputted document information is dissembled into each sentence information and invalid words are removed and primitive is extracted. A pre-processor(200) clusters documents on the basis of non-negative number matrix factorization, by generating a terminology-sentence matrix. A document clustering part(300) recalculates weight of the terminology and generates document clusters.
|
申请公布号 |
KR100876319(B1) |
申请公布日期 |
2008.12.31 |
申请号 |
KR20070081006 |
申请日期 |
2007.08.13 |
申请人 |
INHA-INDUSTRY PARTNERSHIP INSTITUTE |
发明人 |
LEE, JU HONG;PARK, SUN;KIM, DEOK HWAN;AHN, CHAN MIN |
分类号 |
G06F17/10 |
主分类号 |
G06F17/10 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|