摘要 |
<p><P>PROBLEM TO BE SOLVED: To provide a clustering program that can significantly reduce clustering processing time. Ž<P>SOLUTION: Operations of calculating homologies between function-unknown cDNA terminal sequences and known sequences, and operations of classification based on the calculation results are separated. When the function-unknown cDNA terminal sequences are classified according to the degree of homology with the known sequences, an individual sequence number is set for every identifier (sequence), each cDNA terminal sequence and known sequences of high homology with the cDNA terminal sequence are grouped, and an individual cluster number is set for every group. All identifiers belonging to a plurality of cluster numbers sharing the same sequence number are handled as members of the same cluster. If identifiers belonging to a specific cluster number do not belong to any other cluster number (no sharing), only the identifiers belonging to the specific cluster number form cluster members. Ž<P>COPYRIGHT: (C)2005,JPO&NCIPI Ž</p> |