摘要 |
<p>In one illustrative embodiment, a method may comprise receiving a first text-based computer file including one or more records, each of the one or more records comprising nucleotide sequence data generated by a read of a massively parallel sequencing instrument, determining whether a portion of the nucleotide sequence data of each record represents a short tandem repeat (STR) associated with a locus, placing each portion of the nucleotide sequence data determined to represent an STR associated with a locus into one of a number of locus- specific lists, determining a number of occurrences within each locus-specific list of identical nucleotide sequence data representing a unique STR, and generating a second text-based computer file including one or more records, each of the one or more records corresponding to a unique STR for which the number of occurrences of identical nucleotide sequence data representing the unique STR exceeded an abundance threshold.</p> |
申请人 |
BATTELLE MEMORIAL INSTITUTE |
发明人 |
YOUNG, BRIAN, A.;MINARD-SMITH, ANGELA, T.;HEIZER, ESLEY, M., JR.;BORNMAN, DANIEL, M.;HESTER, MARK, E.;YANG, BOYU |