发明名称 METHOD AND APPARATUS FOR AUTOMATICALLY IDENTIFYING KEY WORDSWITHIN A DOCUMENT
摘要 A trainable method of extracting keywords of one or more words is disclosed. According to the method, every word within a document that is not a stop word is stemmed and evaluated and receives a score. The scoring is performed based on a plurality of parameters which are adjusted through training prior to use of the method for keyword extraction. Each word having a high score is then replaced by a word phrase that is delimited by punctuation or stop words. The word phrase is selected from word phrases having the stemmed word therein. Repeated keywords are removed. The keywords are expanded and capitalisation is determined. The resulting list forms extracted keywords.
申请公布号 CA2236623(C) 申请公布日期 2006.11.14
申请号 CA19982236623 申请日期 1998.05.04
申请人 TURNEY, PETER D. 发明人 TURNEY, PETER D.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址