发明名称 ALIGNING AND CLUSTERING SEQUENCE PATTERNS TO REVEAL CLASSIFICATORY FUNCTIONALITY OF SEQUENCES
摘要 A system and method of discovering sequence patterns with variations is provided. The method includes: accessing or acquiring a data set including a family of sequences or related families of sequences; a) applying a pattern discovery process to the sequences; b) grouping and aligning the similar patterns that may have different lengths into one or more Aligned Pattern Clusters; c) discovering the co-occurrence relation between Aligned Patterns and/or Aligned Pattern Clusters to reveal the distal function between segments represented by the aligned Pattern Clusters and d) breaking down an Aligned Pattern Cluster into sub-clusters with stable cluster configuration that reveals sub-clusters with distinct and shared characteristic among sub-family of the sequences.
申请公布号 CA2942106(A1) 申请公布日期 2014.10.23
申请号 CA20142942106 申请日期 2014.04.17
申请人 WONG, ANDREW KA-CHING 发明人 WONG, ANDREW KA-CHING;LEE, EN-SHIUN ANNIE
分类号 G06F19/22;C12Q1/68 主分类号 G06F19/22
代理机构 代理人
主权项
地址