发明名称 METHOD AND SYSTEM FOR BUILDING A PHYLOGENY FROM GENETIC SEQUENCES AND USING THE SAME FOR RECOMMENDATION OF VACCINE STRAIN CANDIDATES FOR THE INFLUENZA VIRUS
摘要 <p>A computer-implemented method and a computer system for identifying a phylogenetic tree from a plurality of biological sequences is provided. Each biological sequence is associated with a sampling date. First, the plurality of biological sequences is aligned and a distance matrix is obtained. Then, a subset of these sequences without any duplicated sequences is selected and a directed graph representation of the subset of biological sequences is generated based on the associated sampling dates. Then, a minimum spanning tree is computed from the weighted directed graph representation. Then, in an iterative procedure, the sequences of unsampled evolutionary intermediates are inferred from mutation patterns that reflect the difference in sequence between the nodes in the minimum spanning tree. The new sequences are added with associated time stamps to the sequence set. Then, sets of identical sequences are removed. Then, an optimum branching is recomputed. This step is repeated until no new intermediates are found. In the final step, the sequences that have been set aside in the inititalizing step are added to the plurality of sequences derived in the update step. From this plurality of sequences an optimum branching is computed and identified as the phylogenetic tree. Amino acid changes repeatedly occurring on the internal branches of the obtained tree can be used to identify sequences and associated viral isolates suitable as vaccine strains for the following influenza season.</p>
申请公布号 EP2362959(A1) 申请公布日期 2011.09.07
申请号 EP20090760761 申请日期 2009.11.25
申请人 MAX-PLANCK-GESELLSCHAFT ZUR FOERDERUNG DER WISSENSCHAFTEN E.V. 发明人 MCHARDY, ALICE, CAROLYN;STEINBRUECK, LARS
分类号 G06F19/00;A61K39/145;G06F19/14;G06F19/26 主分类号 G06F19/00
代理机构 代理人
主权项
地址