主权项 |
1. A method for generating sequence assemblies from short sequencing reads, comprising:
a) fragmenting at least one member of an input library to produce a plurality of linear DNA fragments having a first fragment end and a second fragment end proximal to a fragmentation breakpoint, b) attaching a common nucleic acid adaptor to the first and second linear DNA fragment ends proximal to a fragmentation breakpoint, wherein the common adaptor comprise the same unique sequence tag, c) optionally amplifying the plurality of linear DNA fragments to produce a sequencing library comprising a plurality of amplified DNA fragments, wherein at least one of the plurality of amplified DNA fragments comprises:
i) sequence complementary to at least the unique sequence tag of an adaptor, andii) sequence complementary to at least a portion of a member of the input library, d) sequencing at least a portion of the DNA fragments, wherein the presence of a unique adaptor sequence tag in a plurality of fragment sequences thereby associates the fragment sequences having ends that were proximal to the same fragmentation breakpoint, and e) assembling the plurality of breakpoint tag-associated fragment sequences, or subassembly sequences comprising breakpoint-associated sequences, to generate longer subassembly sequences of the input library. |