摘要 |
Arbitrarily large document collections are processed by expanding a focus set having at least one initial metadocument (82) into a plurality of subsequent metadocuments (83,84,85,86). The number of subsequent metadocuments is approximately equal to a predetermined maximum number. The subsequent metadocuments are then clustered into a predetermined number of new metadocuments, which are summarized and presented to a user. The focus set is redefined to include only user-selected new metadocuments. <IMAGE> |