发明名称 Eliminating redundant patterns in a method using position indices of symbols to discover patterns in sequences of symbols
摘要 The present invention relates to computer-implemented methods for finding patterns in patterns in a set of k-sequences of symbols (where k>=2) and to a computer readable medium having instructions for controlling a computer system to perform the methods. Patterns of symbols common to each 2-tuple of sequences are identified. Each identified pattern of symbols is represented by a position index numerical array (PINA), which is a set of position indices, each of which denotes the location in a selected reference sequence at which each symbol in the pattern occurs. The position index numerical array (PINA) representations of patterns of each tuple at any order "n" may be combined with the PINA pattern representations of all other tuples at that same order "n" or with the pattern representations in any selected m-tuple, where m may have any integer value from 2 to (n-1). The representations of the patterns in an n-tuple are only combined with pattern representations of another tuple that includes in its tuple identifier at least one sequence index greater than the sequence indices included in the tuple identifier of the n-tuple. To avoid redundancies involving pair-wise combinations of representations of patterns all of the sequence indices of the other tuple (other than the reference sequence index) must be different from those of the n-tuple.
申请公布号 US2006235662(A1) 申请公布日期 2006.10.19
申请号 US20060402735 申请日期 2006.04.12
申请人 ARGENTAR DAVID R 发明人 ARGENTAR DAVID R.
分类号 G06F17/10 主分类号 G06F17/10
代理机构 代理人
主权项
地址