发明名称 |
EFFICIENT ENCODING AND STORAGE AND RETRIEVAL OF GENOMIC DATA |
摘要 |
A new method for encoding genomic data that reduces storage footprint by two orders of magnitude while preserving acceptable quality data. |
申请公布号 |
US2015248430(A1) |
申请公布日期 |
2015.09.03 |
申请号 |
US201514625967 |
申请日期 |
2015.02.19 |
申请人 |
Hospodor Andrew;Corderi Ignacio |
发明人 |
Hospodor Andrew;Corderi Ignacio |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for compressing genomic data, the method comprising the steps of:
(i) providing a computer having a memory in functional communication with a processor; (ii) inputting multiple segments of genomic sequences and their quality scores into the computer memory; (iii) providing reference genomic data comprising a sequence of genomic data; (iv) accessing the reference genomic data; (v) aligning the multiple segments with the reference genomic data; (vi) comparing individual nucleotides in the aligned multiple segments using the processor; (vii) creating a de-duplicated sequence of encoded data aligned with the reference genomic data; wherein the encoded data contains nucleotide labels for agreed upon nucleotides at a particular nucleotide location. |
地址 |
Santa Cruz CA US |