摘要 |
<p>A method for extracting a longest common sub-sequence (LCS) from two sequences is applied to the extraction of partial data which is commonly included from data in bulk such as the analysis of protein sequence. The method for extracting an LCS has to trace cases for a pair of possible tokens since the token, which is the smallest unit configuring a sequence, appears several times on the sequence, generally. In case the token does not appear on the sequence two or more times, however, the LCS is able to be extracted more rapidly than an existing method. The present invention is able to extract the LCS in the sequence, in which the token is not duplicated, more rapidly than the existing method.</p> |