发明名称 |
USING RULE INDUCTION TO IDENTIFY EMERGING TRENDS IN UNSTRUCTURED TEXT STREAMS |
摘要 |
A method for identifying emerging concepts in unstructured text streams comprises: selecting a subset V of documents from a set U of documents; generating at least one Boolean combination of terms that partitions the set U into a plurality of categories that represent a generalized, statistically based model of the selected subset V wherein the categories are disjoint inasmuch as each document of U is included in only one category of the partition; and generating a descriptive label for each of the disjoint categories from the Boolean combination of terms for that category.
|
申请公布号 |
US2009292660(A1) |
申请公布日期 |
2009.11.26 |
申请号 |
US20080126829 |
申请日期 |
2008.05.23 |
申请人 |
BEHAL AMIT;CHEN YING;SPANGLER WILLIAM SCOTT |
发明人 |
BEHAL AMIT;CHEN YING;SPANGLER WILLIAM SCOTT |
分类号 |
G06F15/18 |
主分类号 |
G06F15/18 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|