发明名称 |
Clustering based text classification |
摘要 |
Systems and methods for clustering-based text classification are described. In one aspect text is clustered as a function of labeled data to generate cluster(s). The text includes the labeled data and unlabeled data. Expanded labeled data is then generated as a function of the cluster(s). The expanded label data includes the labeled data and at least a portion of unlabeled data. Discriminative classifier(s) are then trained based on the expanded labeled data and remaining ones of the unlabeled data.
|
申请公布号 |
US2005234955(A1) |
申请公布日期 |
2005.10.20 |
申请号 |
US20040921477 |
申请日期 |
2004.08.16 |
申请人 |
MICROSOFT CORPORATION |
发明人 |
ZENG HUA-JUN;WANG XUANHUI;CHEN ZHENG;ZHANG BENYU;MA WEI-YING |
分类号 |
G06F17/30;(IPC1-7):G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|