摘要 |
PURPOSE: A document classification system and a method thereof are provided to accurately classify documents based on the frequency of morphemes and vector information. CONSTITUTION: A morpheme analysis device(100) analyzes morphemes in a document by adding vector information to the morphemes. A classification unit(200) receives the vector information and the morphemes and extracts the vector information from each morpheme. The classification unit classifies the document by calculating the frequency value of the vector information and the morphemes corresponding to the vector information. A first DB(DataBase) stores the frequency value, and a second DB stores the number of document including the vector information. [Reference numerals] (100) Morpheme analysis device; (200) Document classification unit; (300) New document processor; (AA) Policy; (BB) Shopping; (CC) Education; (DD,HH) Electronic; (EE,LL) Vehicle; (FF) Primary document classification; (GG) Secondary document classification; (II) Cellular phone; (JJ) Appliance; (KK) Hardware; (MM) Abroad; (NN) Domestic; (OO) Performance;
|