Systems and techniques to generate a term taxonomy for a collection of documents and filling the taxonomy with documents from the collection. In general, in one implementation, the technique includes: extracting terms from a plurality of documents; generating term pairs from the terms; ranking terms in each term pair based on a relative specificity of the terms; aggregating the ranks of the terms in each term pair; selecting term pairs based on the aggregate rankings; and generating a term hierarchy from the selected term pairs.
申请公布号
WO03060763(A3)
申请公布日期
2004.04.22
申请号
WO2002IB05793
申请日期
2002.12.27
申请人
SAP AKTIENGESELLSCHAFT;WOEHLER, JOHANNES;FAERBER, FRANZ