发明名称 |
SYSTEM AND METHOD FOR TRANSFORMING AND COMPRESSING GENOMICS DATA |
摘要 |
This invention relates to the quality scores of bases produced from high throughput genomic sequencing, in particular to transforming the quality scores for improved compressibility. A method for transforming these quality scores is described whereby a quality score is modified by utilising a Bayesian model based on Coding Theory combined with search results from a genomic corpus. A related method is described for efficient searching for a Read in a genomic corpus so as to find all matching symbols up to a given Hamming-distance or Edit-distance. |
申请公布号 |
US2016357812(A1) |
申请公布日期 |
2016.12.08 |
申请号 |
US201615155902 |
申请日期 |
2016.05.16 |
申请人 |
Greenfield Daniel;Rrustemi Alban |
发明人 |
Greenfield Daniel;Rrustemi Alban |
分类号 |
G06F17/30;G06N7/00 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for adjusting a quality score of a genomic sequence base in a genomic sequence read, the method comprising:
determining a distance to search in a genomic corpus for a genomic sequence read; performing a search using said read on said genomic corpus with said distance; and based on results of said search performing calculations to adjust said quality score of said genomic sequence base in said read. |
地址 |
Cambridge GB |