发明名称 SELECTION DEVICE FOR CANDIDATE SEQUENCE INFORMATION FOR SIMILARITY DETERMINATION, SELECTION METHOD, AND USE FOR SUCH DEVICE AND METHOD
摘要 The present invention provides a device for determining the similarities between sequence information pieces easily. The candidate selection device 10 of the present invention includes an input unit 11, a sequence storage section 121, a similarity degree storage section 122, a candidate sequence storage section 123, a similarity degree calculation unit 131, a candidate sequence selection unit 132, and an output unit 14. The input unit 11 is used to input information on a sequence group and a virtual sequence group. The similarity degree calculation unit 131 selects a comparison source and a comparison target from the sequence group, and calculates the difference in the frequency of each virtual sequence between the comparison source sequence and the comparison target sequence, as the similarity degree of the comparison target sequence with respect to the comparison source sequence. When the similarity degree of the comparison target sequence with respect to the comparison source sequence satisfies the allowable similarity degree condition set for the virtual sequence group, the candidate sequence selection unit 132 selects the comparison source sequence and the comparison target sequence as a candidate sequence group for determination of similarity between the sequences. By determining the similarities between sequences in the candidate sequence group, a certain sequence and a sequence(s) similar thereto can be selected as a similar sequence information group.
申请公布号 US2015379197(A1) 申请公布日期 2015.12.31
申请号 US201414768030 申请日期 2014.02.14
申请人 NEC SOLUTION INNOVATORS, LTD. 发明人 AKITOMI Jou;HORII Katsunori
分类号 G06F19/22;G06F17/30 主分类号 G06F19/22
代理机构 代理人
主权项 1. A candidate selection device for selecting, from a sequence information group comprising sequence information pieces, a candidate sequence information group comprising candidate sequence information pieces that serve as candidates for determination of similarity between the sequence information pieces, the candidate selection device comprising the following units (a), (b), (c), and (d): (a) a unit that performs the step of counting the frequency of each virtual sequence information piece included in a virtual sequence information group in each sequence information piece in the sequence information group; (b) a unit that performs the step of selecting, from the sequence information group, a sequence information piece that serves as a comparison source and a sequence information piece that serves as a comparison target; (c) a unit that performs the step of calculating the difference between the frequency of each virtual sequence information piece in the comparison source sequence information piece and the frequency of each virtual sequence information piece in the comparison target sequence information piece as the similarity degree of the comparison target sequence information piece with respect to the comparison source sequence information piece; and (d) a unit that performs the step of selecting, when the similarity degree of the comparison target sequence information piece with respect to the comparison source sequence information piece satisfies an allowable similarity degree condition set for the virtual sequence information group, the comparison source sequence information piece and the comparison target sequence information piece as the candidate sequence information group for determination of similarity between the sequence information pieces.
地址 Tokyo JP