发明名称 Document automatic classification system, unnecessary word determination method and document automatic classification method
摘要 It is an object of the present invention to eliminate unnecessary words effectively in document automatic classification. A document automatic classification system comprising a classified document set storage device 21 for storing documents classified according to category, a category table generation unit 31 for generating a table broken down by category including information on a frequency of appearance of a word contained in a document acquired from the classified document set storage device 21, an unnecessary word determination and elimination unit 32 for eliminating an unnecessary word for each category from the table on the basis of a frequency of appearance in each category of a given word acquired from the table broken down by category generated by the category table generation unit 31, a classification catalog storage device 22 for storing the table from which the unnecessary word was eliminated by the unnecessary word determination and elimination unit 32, a classification target document storage device 23 for storing documents to be classified, and a document classification processing unit 33 for classifying the documents to be classified stored in the classification target document storage device 23 by using the table stored in the classification catalog storage device 22.
申请公布号 US2004083224(A1) 申请公布日期 2004.04.29
申请号 US20030688217 申请日期 2003.10.15
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 YOSHIDA ISSEI
分类号 G06F17/30;G06F17/00;(IPC1-7):G06F17/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址