摘要 |
A method for identifying clusters of similar documents from among a set of documents is descpbed A particular document is selected based on rank from among a ranked set of documents (Figure 1, Item 102), wherein the ranked set of documents are included among available documents of the set of documents A probe is generated based on the particular document The probe comprising one or more features Documents that satisf) a similarity condition are found from among the available documents using a search based upon the probe (Figure 1, Item 105, and 106) Some or all documents found are associated with a particular cluster of documents (Figure 1, 108) The process can be repeated to generate fiirther clusters (Figure 1, 110) The method can be implemented with a computer, and associated programming instructions can be contained within a compute readable carrier.
|
申请人 |
JUSTSYSTEMS EVANS RESEARCH, INC.;EVANS, DAVID, A.;SHEFTEL, VICTOR, M.;BENNETT, JEFFREY, K.;HULL, DAVID, A. |
发明人 |
EVANS, DAVID, A.;SHEFTEL, VICTOR, M.;BENNETT, JEFFREY, K.;HULL, DAVID, A. |