摘要 |
Method and system for selectively retrieving information contained in a stored document set using a metric-based or "fuzzy" finite-state non-deterministic automaton. An automaton is constructed (501) corresponding to a text string query, text strings are read (502) from storage and corresponding dissimilarity values are generated (505). Those strings resulting in values less than a given threshold are recorded (508) and listed for the user. Dissimilarity values are determined based on penalties associated with missing characters, extra characters, incorrect characters, and other differences between the text string query and a text string read from storage. <IMAGE> |