摘要 |
<p><P>PROBLEM TO BE SOLVED: To provide a means for classifying relative living organism not compared by analysis based on rRNA, and a means for comparison-analyzing the similarity between long genome base sequences, for example, genome base sequences including several millions or more of bases. <P>SOLUTION: A method of analyzing the similarity between the base sequences includes: (a) a procedure for dividing the base sequence to prepare a segment group; (b) a procedure for counting the appearance number of each nucleic acid constitutive base; (c) a procedure for allocating a maldistributive display base; (d) a procedure for calculating a maldistributive score; (e) a procedure for preparing a maldistributive display sequence; a procedure for determining an objective area for calculating a similarity score expressing the similarity between the first maldistributive display sequence obtained by carrying out the procedures (a)-(e) in the first base sequence, and the second maldistributive display sequence obtained by carrying out the procedures (a)-(e) in the second base sequence; and a procedure for calculating the similarity score expressing the similarity between the first maldistributive display sequence and the second maldistributive display sequence, using the first maldistributive score and the second maldistributive score. <P>COPYRIGHT: (C)2010,JPO&INPIT</p> |