发明名称 Diverse topic phrase extraction
摘要 Systems and methods for implementing diverse topic phrase extraction are disclosed. According to one implementation, multiple word candidate phrases are extracted from a corpus and weighed. One or more documents are re-weighed to identify less obvious candidate topics using latent semantic analysis (LSA). Phrase diversification is then used to remove redundancy and select informative and distinct topic phrases.
申请公布号 US8280877(B2) 申请公布日期 2012.10.02
申请号 US20070859461 申请日期 2007.09.21
申请人 ZHANG BENYU;CHEN JILIN;CHEN ZHENG;ZENG HUAJUN;WANG JIAN;MICROSOFT CORPORATION 发明人 ZHANG BENYU;CHEN JILIN;CHEN ZHENG;ZENG HUAJUN;WANG JIAN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址