发明名称 |
Computation and Analysis of Significant Themes |
摘要 |
Systems and computer-implemented processes for computation and analysis of significant themes in a corpus of documents. The computation and analysis of significant themes can be executed on a processor and involves generating a lexical unit document association (LUDA) vector for each lexical unit that has been provided and quantifying similarities between each unique pair of lexical units. The LUDA vector characterizes a measure of association between its corresponding lexical unit and documents in the corpus. The lexical units can then be grouped into clusters such that each cluster contains a set of lexical units that are most similar as determined by the LUDA vectors and a predetermined clustering threshold.
|
申请公布号 |
US2011004465(A1) |
申请公布日期 |
2011.01.06 |
申请号 |
US20090568365 |
申请日期 |
2009.09.28 |
申请人 |
BATTELLE MEMORIAL INSTITUTE |
发明人 |
ROSE STUART J.;COWLEY WENDY E.;CROW VERNON L. |
分类号 |
G06F17/27 |
主分类号 |
G06F17/27 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|