摘要 |
PROBLEM TO BE SOLVED: To extract a suitable representative notation from a given group of character strings.SOLUTION: A plurality of given character strings are divided into tokens respectively. On all token arrays in which one or more tokens are directly connected to constitute all or a part of the character strings, the number of character strings including the token array out of the plurality of character strings is counted. The token array in which the counted number of character strings is a predetermined threshold or more, that is, the token array excluding the token array included in the other long token array out of the token array, is selected. The selected token array is output as the extracted result. |