发明名称 |
METHOD AND SYSTEM OF RANKING AND CLUSTERING FOR DOCUMENT INDEXING AND RETRIEVAL |
摘要 |
A relevancy ranking and clustering method and system that determines the relevance of a document relative to a user's query using a similarity comparison process. Input queries are parsed into one or more query predicate structures using an ontological parser. The ontological parser parses a set of known documents to generate one or more document predicate structures. A comparison of each query predicate structure with each document predicate structure is performed to determine a matching degree, represented by a real number. A multilevel modifier strategy is implemented to assign different relevance values to the different parts of each predicate structure match to calculate the predicate structure's matching degree. The relevance of a document to a user's query is determined by calculating a similarity coefficient, based on the structures of each pair of query predicates and document predicates. Documents are autonomously clustered using a self-organizing neural network that provides a coordinate system that makes judgments in a non-subjective fashion. |
申请公布号 |
WO02057961(A3) |
申请公布日期 |
2003.10.09 |
申请号 |
WO2002US00402 |
申请日期 |
2002.01.10 |
申请人 |
SCIENCE APPLICATIONS INTERNATIONAL CORPORATION |
发明人 |
CAUDILL, MAUREEN;TSENG, JASON, CHUN-MING;WANG, LEI |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|