Presented is a method of extracting keywords. The method includes obtaining a corpus of documents, determining a first set of words that appear as keywords in a document present in the corpus of documents, determining a second set of words that appear in the corpus of documents but not necessarily appear as keywords in the document, and determining a final set of keywords for the document by combining the first set of words with the second set of words.
申请公布号
WO2011127655(A1)
申请公布日期
2011.10.20
申请号
WO2010CN71758
申请日期
2010.04.14
申请人
HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P.;YANG, SHENG-WEN;XIONG, YUHONG;LIU, WEI