摘要 |
The present disclosure presents a unified system to phase a personal genome for downstream clinical interpretation. In an embodiment, an initial phasing is generated using public datasets, such as haplotypes from the 1000 Genomes Project, and a phasing toolkit. A local perturbation algorithm is applied to improve long range phasing. If available, a Mendelian inheritance pipeline is applied to identify phasing of novel and rare variants. These datasets are merged, followed by correction by any experimental data. This allows for full clinical interpretation of the role of a group of variants in a gene, whether inherited or de novo variants.
|