摘要 |
The invention relates to a computer implemented method of storing/recovering in/from a storage data structure a multitude of sequences that have been aligned with a reference data structure. The information of the sequences is stored in different sections. Each section comprises data streams comprising specific data of the sequences having a reference position in the reference position range associated with the data stream. In a first section, the length of the sequences is stored. In a second section, the mutations of a sequence with respect to the reference sequence are stored. In a third section, consensus based quality values are linked with positions in the reference sequence. In a fourth section, the sequence identifiers are stored. The storage data structure has a format which is optimized for viewer, re-alignment, variant calling and other post-processing tools. |