发明名称 SYSTEM AND METHOD FOR TRANSFORMING AND COMPRESSING GENOMICS DATA
摘要 This invention relates to the quality scores of bases produced from high throughput genomic sequencing, in particular to transforming the quality scores for improved compressibility. A method for transforming these quality scores is described whereby a quality score is modified by utilising a Bayesian model based on Coding Theory combined with search results from a genomic corpus. A related method is described for efficient searching for a Read in a genomic corpus so as to find all matching symbols up to a given Hamming-distance or Edit-distance.
申请公布号 US2016357812(A1) 申请公布日期 2016.12.08
申请号 US201615155902 申请日期 2016.05.16
申请人 Greenfield Daniel;Rrustemi Alban 发明人 Greenfield Daniel;Rrustemi Alban
分类号 G06F17/30;G06N7/00 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method for adjusting a quality score of a genomic sequence base in a genomic sequence read, the method comprising: determining a distance to search in a genomic corpus for a genomic sequence read; performing a search using said read on said genomic corpus with said distance; and based on results of said search performing calculations to adjust said quality score of said genomic sequence base in said read.
地址 Cambridge GB