发明名称 EFFICIENT ENCODING AND STORAGE AND RETRIEVAL OF GENOMIC DATA
摘要 A new method for encoding genomic data that reduces storage footprint by two orders of magnitude while preserving acceptable quality data.
申请公布号 US2015248430(A1) 申请公布日期 2015.09.03
申请号 US201514625967 申请日期 2015.02.19
申请人 Hospodor Andrew;Corderi Ignacio 发明人 Hospodor Andrew;Corderi Ignacio
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method for compressing genomic data, the method comprising the steps of: (i) providing a computer having a memory in functional communication with a processor; (ii) inputting multiple segments of genomic sequences and their quality scores into the computer memory; (iii) providing reference genomic data comprising a sequence of genomic data; (iv) accessing the reference genomic data; (v) aligning the multiple segments with the reference genomic data; (vi) comparing individual nucleotides in the aligned multiple segments using the processor; (vii) creating a de-duplicated sequence of encoded data aligned with the reference genomic data; wherein the encoded data contains nucleotide labels for agreed upon nucleotides at a particular nucleotide location.
地址 Santa Cruz CA US