发明名称 Use of common words, and common word bound prefixes, infixes, and suffixes for natural language and genre determination; for serving as a student study aid of textbook material; for generating automated indexes, automated keywords of documents, and automated queries; for data-base reduction (compaction); and for sorting documents in heterogenous databases
摘要 The use of "most common words' of a language as token sets has great utility when used properly. The prior State of the Art, as exemplified by the invention described in Martino et al., U.S. Pat. No. 6,216,102, has not recognized the complete theoretical basis of "most common words' that might explain why a small set of words represents a majority of the words used in language when language comprises of an infinite set of potential words. Thus, Martino et al. have introduced an invention that is limited in scope due to misconceptions based upon the prior State of the Art with respect to the true theoretical underpinnings of "most common words'. The prior "State of the Art' excludes consideration of common affixes (bound MCW's) from the class of MCW's. In this application for which claims of invention are made, common affixes are considered as belonging to the class of MCW's.
申请公布号 US2003125930(A1) 申请公布日期 2003.07.03
申请号 US20010166329 申请日期 2001.08.04
申请人 STEPAK ASA MARTIN 发明人 STEPAK ASA MARTIN
分类号 G06F17/27;(IPC1-7):G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址