发明名称 System and method for performing efficient document scoring and clustering
摘要 A system and method for providing efficient document scoring of concepts within a document set is described. A frequency of occurrence of at least one concept within a document retrieved from the document set is determined. A concept weight is analyzed reflecting a specificity of meaning for the at least one concept within the document. A structural weight is analyzed reflecting a degree of significance based on structural location within the document for the at least one concept. A corpus weight is analyzed inversely weighing a reference count of occurrences for the at least one concept within the document. A score associated with the at least one concept is evaluated as a function of the frequency, concept weight, structural weight, and corpus weight.
申请公布号 US7610313(B2) 申请公布日期 2009.10.27
申请号 US20030626984 申请日期 2003.07.25
申请人 ATTENEX CORPORATION 发明人 KAWAI KENJI;EVANS LYNNE MARIE
分类号 G06F17/30;G06F17/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址