发明名称 METHOD AND SYSTEM FOR RETRIEVING, DETECTING AND IDENTIFYING MAIN CLUSTER AND OUTLIER CLUSTER IN LARGE SCALE DATABASE, RECORDING MEDIUM AND SERVER
摘要 PROBLEM TO BE SOLVED: To provide a method and a system for detecting, retrieving and identifying a main cluster and an outlier cluster in a large scale database, and to provide a recording medium and a server. SOLUTION: This method includes a step for generating a document matrix from a preceding document by using at least one attribute, a step for generating a residual matrix scaled on the basis of the document matrix from a prescribed function, a step for performing singular value decomposition to obtain a base vector corresponding to a maximum singular value, a step for reconstructing the residual matrix, dynamically scaling the reconstructed residual matrix and obtaining another base vector, a step for repeating from the singular value decomposition step to the reconstruction step to generate a set of prescribed base vectors, and a step for performing dimensional reduction of the document matrix and detecting, retrieving and identifying a document in a database.
申请公布号 JP2003030222(A) 申请公布日期 2003.01.31
申请号 JP20010205183 申请日期 2001.07.05
申请人 INTERNATL BUSINESS MACH CORP <IBM> 发明人 KOBAYASHI MEI;AONO MASAKI;SAGAWA HIKARI;TAKEUCHI HIROYOSHI
分类号 G06F17/16;G06F7/00;G06F17/18;G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/16
代理机构 代理人
主权项
地址