发明名称 |
ALIGNING AND CLUSTERING SEQUENCE PATTERNS TO REVEAL CLASSIFICATORY FUNCTIONALITY OF SEQUENCES |
摘要 |
A system and method of discovering sequence patterns with variations is provided. The method includes: accessing or acquiring a data set including a family of sequences or related families of sequences; a) applying a pattern discovery process to the sequences; b) grouping and aligning the similar patterns that may have different lengths into one or more Aligned Pattern Clusters; c) discovering the co-occurrence relation between Aligned Patterns and/or Aligned Pattern Clusters to reveal the distal function between segments represented by the aligned Pattern Clusters and d) breaking down an Aligned Pattern Cluster into sub-clusters with stable cluster configuration that reveals sub-clusters with distinct and shared characteristic among sub-family of the sequences. |
申请公布号 |
CA2942106(A1) |
申请公布日期 |
2014.10.23 |
申请号 |
CA20142942106 |
申请日期 |
2014.04.17 |
申请人 |
WONG, ANDREW KA-CHING |
发明人 |
WONG, ANDREW KA-CHING;LEE, EN-SHIUN ANNIE |
分类号 |
G06F19/22;C12Q1/68 |
主分类号 |
G06F19/22 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|