发明名称 METHOD, DEVICE, AND PROGRAM FOR AUTOMATICALLY CLASSIFYING TEXT, AND RECORDING MEDIUM
摘要 PROBLEM TO BE SOLVED: To provide a method, device, and program for automatically classifying texts making it possible to improve classifying accuracy, and to provide a recording medium. SOLUTION: The method for automatically classifying texts uses texts composed of character-string information as the subject of processing, and includes letting a user input a plurality (N) of independent texts, and classifying the texts into a group whose number is smaller than N, based on the similarities among the texts. A process for adapting a cluster analysis processing to the input N texts is carried out, and if two clusters that resemble most are detected as a set of clusters during the process, for a plurality of elements constituting each cluster of the set of clusters, the similarities among the elements are determined among the clusters and compared with a predetermined threshold; when the comparison result does not meet predetermined requirements, the set of clusters is excluded from those whose similarities are to be determined. COPYRIGHT: (C)2004,JPO&NCIPI
申请公布号 JP2004206355(A) 申请公布日期 2004.07.22
申请号 JP20020373868 申请日期 2002.12.25
申请人 NIPPON TELEGR & TELEPH CORP <NTT> 发明人 SUGIZAKI MASAYUKI;MAKINO TOSHIAKI;MIYAMOTO MASARU;IBARAKI HISASHI
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址