发明名称 |
INFORMATION RETRIEVAL |
摘要 |
An Internet information agent (16) accepts a reference document, performs an analysis upon it in accordance with metrics defined by its analysis algorithm and obtains respective lists (word, character-level n-gram, word-level n-gram), derives weights corresponding to the metrics, applies the metrics to a candidate document and obtains respective returned values, applies the weights to the returned values and sums the results to obtain a Document Dissimilarity value. This DD is compared with a Dissimilarity Threshold and the candidate document is stored if the DD is less than the DT. A user can apply relevance values to the search results and the agent modifies the weights accordingly. The agent can be used to improve a language model for use in speech recognition applications and the like.
|
申请公布号 |
WO9834180(A1) |
申请公布日期 |
1998.08.06 |
申请号 |
WO1998GB00294 |
申请日期 |
1998.01.30 |
申请人 |
BRITISH TELECOMMUNICATIONS PUBLIC LIMITED COMPANY;WYARD, PETER, JOSEPH;ROSE, TONY, GERARD |
发明人 |
WYARD, PETER, JOSEPH;ROSE, TONY, GERARD |
分类号 |
G06F17/30;(IPC1-7):G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|