发明名称 Method for detecting and extracting text data using database schemas
摘要 An Information Filtering (IF) system for retrieving relevant text data from a data base document collection is disclosed. A user can use this system to access a dynamic data stream to retrieve relevant data such as accessing e-mail or a wire-service. Alternatively, a user can use the IF system to access an data storage archive such as electronically stored patents, journals and the like. The invention includes several steps. The first step has a user reduce the information they are interested in into a tangible form such as manually writing a natural language user need statement, or alternatively imputing the statement electronically into a computer file for storage. The next step is to create a filter window having an adjustable document viewing text length, that will be used to electronically scan through the database collection of documents in order to determine a relevancy value for each scanned document. The filter can be created several ways using synonym and domain lists. Alternatively, the synonym and lists for each document can be determined by Entity-Relationship (ER) modelling to generate a search schema. After documents receive relevancy values, the user is free to view only those documents having relevancy values that exceed a preselected threshold value. Documents can be ranked from most relevant to least relevant. Feedback information from viewing the retrieved documents can be used to update the synonym/domain lists of the filtering window to enhance the relevance retrieval of subsequent documents.
申请公布号 US5717913(A) 申请公布日期 1998.02.10
申请号 US19950368045 申请日期 1995.01.03
申请人 UNIVERSITY OF CENTRAL FLORIDA 发明人 DRISCOLL, JAMES R.
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址