发明名称 DOCUMENT KEY PHRASE EXTRACTION METHOD
摘要 A computer-implemented method of extracting key phrases from a document is disclosed comprising the steps of accessing a repository comprising linked subjects, the repository comprising first and second data structures representing the relationship between said subjects using different representation criteria; pruning the first data structure by removing links between subjects based on a further relationship between said subjects in the second data structure; matching phrases in said document to subjects in the pruned first data structure; further pruning the pruned first data structure by removing unmatched subjects that are not linked to matched subjects; determining a ranking for each matched subject; and selecting key phrases using the determined subject rankings. A computer program for implementing the steps of this method when executed on a computer is also disclosed.
申请公布号 WO2010130083(A1) 申请公布日期 2010.11.18
申请号 WO2009CN71744 申请日期 2009.05.12
申请人 SHANGHAI HEWLETT-PACKARD CO., LTD;HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P.;ZHOU, BAO-YAO;LUO, PING;YANG, SHENG-WEN;XIONG, YUHONG;LIU, WEI 发明人 ZHOU, BAO-YAO;LUO, PING;YANG, SHENG-WEN;XIONG, YUHONG;LIU, WEI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址