主权项 |
1. A method of sequence mapping, the method comprising:
receiving a data set of sequences corresponding to a plurality of fragments of genetic material, the data set including a first sequence, wherein the first sequence comprises at least one ambiguous position; applying a key pattern to the first sequence to generate an initial key; substituting one or more bases at the ambiguous position in the initial key to generate a set of one or more substituted keys from the initial key, wherein the substituted key is a possible match to a reference key in a reference index, the reference index being generated by respectively applying the key pattern to each of a plurality of locations of a reference sequence to obtain a plurality of reference keys, the application of the key pattern to each of the locations causing a selection of a same number of specified positions of the reference sequence relative to the respective location; and comparing the substituted keys to the reference keys in the reference index to identify one or more reference keys that match to one or more of the substituted keys, thereby determining one or more candidate locations of the first sequence in the reference sequence, wherein the method is performed with a computer. |