摘要 |
A system is provided for comparing an input query with a number of stored annotations to identify information to be retrieved from a database 29. The comparison technique divides the input query into a number of fixed-size fragments and identifies how many times each of the fragments occurs within each annotation. The frequencies of occurrence of the fragments in both the query and the annotation are then compared using an equation derived from a multinomial model to provide a measure of the similarity between the query and the annotation. The information to be retrieved is then determined from the similarity measures obtained for all the annotations. |