发明名称 DOCUMENT CHARACTERISTIC ANALYSIS DEVICE FOR DOCUMENT TO BE SURVEYED.
摘要 <p>input means (1) for inputting a document-to-be-surveyed d and documents-to-be-compared P; index entry word extraction means (120) for extracting an index entry word from the document-to-be-surveyed d; first appearance frequency calculation means (142) for calculating a function value IDF (P) of the appearance frequency of the extracted index entry word in the documents-to-be-compared P; similar documents selecting means (160) for selecting similar documents S similar to the document-to-be-surveyed d in the documents-to-be-compared P according to the data on the document-to-be-surveyed d; second appearance frequency calculation means (171) for calculating the function value IDF (S) of the appearance frequency of the extracted index entry word in the similar documents S; and output means (4) for outputting each index entry and its positioning data according to the combination of the function values of the respective appearance frequencies in the documents-to-be-compared and the similar documents which have been calculated. Thus, it is possible to accurately grasp the feature of the document-to-be-surveyed.</p>
申请公布号 MXPA06004513(A) 申请公布日期 2006.09.04
申请号 MX2006PA04513 申请日期 2004.10.13
申请人 INTELLECTUAL PROPERTY BANK CORP. 发明人 HARU-TADA SATO
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址