<p>Disclosed information exploration system and method embodiments operate on a document set to determine a document cluster hierarchy. An exclusionary phrase index is determined for each cluster, and representative phrases are selected from the indexes. The selection process may enforce pathwise uniqueness and balanced sub-cluster representation. The representative phrases may be used as cluster labels in an interactive information exploration interface.</p>
申请公布号
WO2007059225(A2)
申请公布日期
2007.05.24
申请号
WO2006US44367
申请日期
2006.11.15
申请人
ENGENIUM CORPORATION;THOMPSON, KEVIN, B.;SOMMER, MATTHEW, S.