发明名称 Method and product for determining salient features for use in information searching
摘要 A method and product are provided for generating a word set for use in locating a document having a type similar to a type of document in a document collection. The method includes selecting multiple documents from the document collection, each document selected including multiple words, and stemming the words in each document selected to obtain multiple stem words. The method also includes determining a word count for each stem word in each document, and clustering the stem words based on the word count of each stem word in each document to obtain a word set. The product includes a storage medium having programmed instructions recorded thereon for performing the method steps.
申请公布号 US5924105(A) 申请公布日期 1999.07.13
申请号 US19980013453 申请日期 1998.01.26
申请人 MICHIGAN STATE UNIVERSITY 发明人 PUNCH, III, WILLIAM F.;WULFEKUHLER, MARILYN R.;GOODMAN, ERIK D.
分类号 G06F17/27;(IPC1-7):G06F17/21 主分类号 G06F17/27
代理机构 代理人
主权项
地址