摘要 |
The present invention relates to systems and methods for describing unstructured or semi-structured documents in a collection to improve the effectiveness of search, the quality of human browsing, and the automation of information handling processes. One embodiment of the invention provides methods for annotating documents and fragments of documents with terms from an Extensible Structured Controlled Vocabulary (ESCV). This vocabulary can be an artificial language whose terms are connected to one another by a fixed variety of relations and which can be used in expanding searches, presenting documents or sets of documents, or making decisions about document disposition. The vocabulary can also be extended with new terms but only by relating those new terms to existing terms in the vocabulary.
|