发明名称 SYSTEM AND METHOD FOR CLASSIFYING DOCUMENT
摘要 A document classification system and a document classification method are provided to classify target documents on the basis of interrelation among keywords by grouping plural keywords while considering interrelation rate of the keywords in accordance with frequency appearing together in the target documents. A document classification system(200) includes a keyword extractor(210), an interrelation rate calculator(220), a cluster generator(230) and a classification learning unit(240). The keyword extractor extracts plural keywords from target documents and identifies sequentially the extracted keywords as index keywords. The interrelation rate calculator calculates interrelation rate among the identified index keywords and the extracted keywords with exception of the index keywords. The cluster generator groups the index keywords whose interrelation rate is within an allowance value into a cluster. The classification learning unit classifies the target documents by using the generated cluster. Meanwhile, a cluster search engine(110) which is located inside or outside the document classification system discriminates classification keywords corresponding to inquiries inputted from the user and searches the object of documents related to the classified keywords.
申请公布号 KR20080041388(A) 申请公布日期 2008.05.13
申请号 KR20060109423 申请日期 2006.11.07
申请人 NHN CORPORATION 发明人 KOO, JONG MAN;DO, GWAN PYO
分类号 G06F17/21;G06F17/30 主分类号 G06F17/21
代理机构 代理人
主权项
地址