摘要 |
A sequence data analyzer comprising: a read dictionary preparation unit creating a read sequence dictionary based on a concatenation string, the concatenation string constituted of a pair of a left sequence and a right sequence, which are obtained by sequencing a sample DNA fragment respectively from the left and right ends, and connecting characters connecting these sequences together; and a sample reconstruction unit extracting, as a sample sequence, a string up to a terminal character positioned in the string of a hit position of a query sequence in the read sequence dictionary, and extracting, as a mate sequence, the left sequence or right sequence until the appearance of a terminal character on the side where the hit position doesn't exist in the sample sequence. |