发明名称 METHOD FOR ASSEMBLY OF NUCLEIC ACID SEQUENCE DATA
摘要 The present invention relates to a method for assembly of nucleic acid sequence data comprising nucleic acid fragment reads into (a) contiguous nucleotide sequence segment(s), comprising the steps of: (a) obtaining a plurality of nucleic acid sequence data from a plurality of nucleic acid fragment reads; (b) aligning said plurality of nucleic acid sequence data to a reference sequence;(c) detecting one or more gaps or regions of non-assembly, or non-matching with the reference sequence in the alignment output of step (b);(d) performing de novo sequence assembly of nucleic acid sequence data mapping to said gaps or regions of non-assembly; and (e) combining the alignment output of step (b) and the assembly output of step (d) in order to obtain (a) contiguous nucleotide sequence segment(s). The present invention further relates to a method wherein the detection of gaps or regions of non-assembly is performed by implementing a base quality, coverage, complexity of the surrounding region, or length of mismatch filter or threshold. Also envisaged is the masking out of nucleic acid sequence data relating to known polymorphisms, disease related mutations or modifications, repeats, low map ability regions, CPG islands, or regions with certain biophysical features. In addition, a corresponding program element or computer program for assembly of nucleic acid sequence data and a sequence assembly system for transforming nucleic acid sequence data comprising nucleic acid fragment reads into (a) contiguous nucleotide sequence segment(s) is provided.
申请公布号 WO2012168815(A2) 申请公布日期 2012.12.13
申请号 WO2012IB52613 申请日期 2012.05.24
申请人 KONINKLIJKE PHILIPS ELECTRONICS N.V.;KUMAR, SUNIL;SINGH, RANDEEP;DIMITROVA, NEVENKA 发明人 KUMAR, SUNIL;SINGH, RANDEEP;DIMITROVA, NEVENKA
分类号 G06F19/00 主分类号 G06F19/00
代理机构 代理人
主权项
地址