摘要 |
A method to facilitate accurate document searching and document classification. Documents are processed with a collection of inter-related dictionaries to minimize the variations of words, word capitalization, hyphenation, and other variations that normally occur between different authors. In addition, important words in a document are automatically identified and a different level of importance assigned to each word depending on its importance determined by a combination of factors such as its importance across many documents, within a document, in each document section, or the ratio with respect to other words within the document. The documents to be analyzed, database of documents, processing systems, and user input / output are operate across a range of device capabilities and are flexible in their physical placement. A document list may be created for identifying documents that are related to a source document. |