摘要 |
An Information Filtering (IF) system for retrieving relevant text data from a data base document collection is disclosed. A user can use this system to access a dynamic data stream to retrieve relevant data such as accessing e-mail or a wire-service. Alternatively, a user can use the IF system to access an data storage archive such as electronically stored patents, journals and the like. The invention includes several steps. The first step has a user reduce the information they are interested in into a tangible form such as manually writing a natural language user need statement, or alternatively imputing the statement electronically into a computer file for storage. The next step is to create a filter window having an adjustable document viewing text length, that will be used to electronically scan through the database collection of documents in order to determine a relevancy value for each scanned document. The filter can be created several ways using synonym and domain lists. Alternatively, the synonym and lists for each document can be determined by Entity-Relationship (ER) modelling to generate a search schema. After documents receive relevancy values, the user is free to view only those documents having relevancy values that exceed a preselected threshold value. Documents can be ranked from most relevant to least relevant. Feedback information from viewing the retrieved documents can be used to update the synonym/domain lists of the filtering window to enhance the relevance retrieval of subsequent documents.
|