发明名称 Apparatus for retrieving similar documents and apparatus for extracting relevant keywords
摘要 After three kinds of data, i.e., a keyword frequency-of-appearance (103), a document length (105), and a keyword weight (107) are produced, a document profile vector (111) and a keyword profile vector (109) are calculated. Then, by independently performing the weighted principal component analysis (112,114) considering the document length and the keyword weight, a document feature vector and a keyword feature vectors are obtained. Then, documents and keywords having higher similarity to the feature vectors calculated with reference to the retrieval and extracting conditions are obtained and displayed. <IMAGE>
申请公布号 EP1168202(A2) 申请公布日期 2002.01.02
申请号 EP20010305637 申请日期 2001.06.28
申请人 MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. 发明人 KANNO, YUJI
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址