发明名称 Method and apparatus for digitally shredding similar documents within large document sets in a data processing environment
摘要 A method and apparatus are disclosed for comparing an input or query file to a set of files to detect similarities between the query file and the set of files, and digitally shredding files that match, to some degree, the query file and doing so from within the comparison feature. Using a comparison program, the query file is compared with each non-query file in a data processing system, ranging from a stand-alone computer to an enterprise computing network. A list of non-query files having some degree of similarity with the query file is compiled and presented to the user via a user interface within the comparison program. Certain or all non-query files can then be deleted by marking the names of those non-query files in the list. The comparison program can be of the type using either clustering or coalescing, or both, known hashing techniques, or other comparison algorithms.
申请公布号 AU5136699(A) 申请公布日期 2000.02.21
申请号 AU19990051366 申请日期 1999.07.30
申请人 THE REGENTS OF THE UNIVERSITY OF CALIFORNIA, 发明人 ALEXANDER AIKEN
分类号 G06F17/22;G06F17/27;G06F17/30 主分类号 G06F17/22
代理机构 代理人
主权项
地址