摘要 |
<p>The invention discloses a method and device for assembling a genome sequence. The method comprises the following steps: filtering short segment sequences output after long insertion segment library tail end sequencing, thereby removing unqualified sequences; comparing the short segment sequences after filtering with a reference genome sequence; dividing paired short segment sequences for comparison into soap reads sequences, single reads sequences and unmap reads sequences according to the comparison result, and making statistics on the quantity of each type of sequences; calculating the distance between the paired short segment sequences on the same segment of the reference genome sequence by utilizing the soap reads sequences, and making statistics on the distance distribution of all the paired short segment sequences on the reference genome sequence; and when the distance distribution satisfies the threshold requirement, assembling the genome sequence by utilizing the unique paired single reads sequences with different segments on the reference genome sequence.</p> |
申请人 |
BGI SHENZHEN CO., LIMITED;BGI SHENZHEN;HAN, CHANGLEI;CHEN, WENBIN;ZHANG, XIUQING;YANG, HUANMING |
发明人 |
HAN, CHANGLEI;CHEN, WENBIN;ZHANG, XIUQING;YANG, HUANMING |