摘要 |
<P>PROBLEM TO BE SOLVED: To provide a method for automatically selecting a subset of documents from a set of a large quantity of documents based on a visual appearance. <P>SOLUTION: A method for automatically selecting sample pages from many documents for proofing comprises: accessing a knowledge aggregate including a plurality of documents therein; characterizing at least part of the knowledge aggregate in a multi-dimensional vector space; grouping the documents into a plurality of groups with a cluster analysis method; and automatically selecting a subset of the knowledge aggregate from at least one group for display including conversion preparation for rendering and proofing. <P>COPYRIGHT: (C)2012,JPO&INPIT |