摘要 |
<p>Provided is a device for easily determining the similarities between sequence information items. This candidate selection device (10) is provided with an input means (11), a sequence storage section (121), a degree of similarity storage section (122), a candidate sequence storage section (123), a degree of similarity calculation means (131), a candidate sequence selection means (132), and an output means (14). The input means (11) inputs information on sequence groups and virtual sequence groups. The degree of similarity calculation means (131) selects from the sequence groups a comparison source and a comparison destination, and calculates the difference in frequency of the virtual sequences with respect to a comparison source sequence and a comparison destination sequence, as a degree of similarity of the comparison destination sequence with respect to the comparison source sequence. In a case where the degree of similarity of the comparison destination sequence with respect to the comparison source sequence fulfills the allowable conditions of the degree of similarity set to the virtual sequence group, the candidate sequence selection means (132) selects the comparison source sequence and the comparison destination sequence as a candidate sequence group for determining the similarities between sequences. For the candidate sequence group, by determining the similarities between sequences, one sequence and a sequence similar thereto are selected as a similar sequence information group.</p> |