发明名称 APPARATUS FOR PROVIDING DOCUMENT CLUSTERING USING RE-WEIGHTED TERM
摘要 A clustering apparatus and a method thereof are presented to minimize semantic difference between a cluster required by a user and a cluster provided by a system. According to a document clustering apparatus using a terminology weight recalculation, inputted document information is dissembled into each sentence information and invalid words are removed and primitive is extracted. A pre-processor(200) clusters documents on the basis of non-negative number matrix factorization, by generating a terminology-sentence matrix. A document clustering part(300) recalculates weight of the terminology and generates document clusters.
申请公布号 KR100876319(B1) 申请公布日期 2008.12.31
申请号 KR20070081006 申请日期 2007.08.13
申请人 INHA-INDUSTRY PARTNERSHIP INSTITUTE 发明人 LEE, JU HONG;PARK, SUN;KIM, DEOK HWAN;AHN, CHAN MIN
分类号 G06F17/10 主分类号 G06F17/10
代理机构 代理人
主权项
地址