主权项 |
1. A method for resolving queries of a biological sequence data source comprising:
allocating memory for a bit array; initializing bit values in the bit array; for a set of key-value mappings between a biological sequence k-mer and a characterization value, inserting each key-value mapping into a bit array, by:
applying a set of hash operators to the biological sequence k-mer to obtain a first set of indexes, each index in the first set of indexes corresponding to a section of the bit array,encoding the characterization value to a binary bit mask of fixed-weight;bitwise ORing the binary bit mask with sections of the bit array corresponding to each index in the first set of indexes; receiving a biological sequence query comprising a set of overlapping biological sequence k-mers; selecting a set of overlapping biological sequence k-mers in the biological sequence query; and extracting a k-mer characterization for each biological sequence k-mer in the set of overlapping biological sequence k-mers, by:
applying a set of hash operators to the biological sequence k-mer to select a second set of indexes,retrieving a set of binary masks, each binary mask in the set of binary masks corresponding to a location corresponding to an index in the second set of indexes,bitwise ANDing the set of binary mask into a combined bit mask,decoding the combined bit mask into a result characterization. |