摘要 |
A method for adjusting the quality score of a genomic sequence comprises determining a distance to search in a genomic corpus for a genomic sequence read, the distance being particularly a Hamming-distance or edit-distance. The results of a search using the read with the distance on the genomic corpus is used to adjust the quality score of the genomic sequence base. Calculations to adjust the quality score preferably utilises a Bayesian estimation of the likelihood that a base in the read has a sequencing error. Also claimed is a method for searching a genomic sequence read into a genomic corpus, the sequence read being partitioned into a multiplicity of slots, performing a look operation on each slot into said genomic corpus and combining the candidate results from each slot. |