摘要 |
PROBLEM TO BE SOLVED: To suppress excessive insertion that a blank character code is inserted into a position where it should not be originally inserted in comparison with the case that a blank character code is inserted by detecting a position between words by a discrimination analysis method or the like with respect to document data consisting of language written with a space between words in which any blank character code is not inserted between words.SOLUTION: A character data acceptance part 301 accepts document data. A character string acquisition part 302 acquires a character string on the basis of a character code included in the document data. A character interval list creation part 305 creates a character interval list in which the character intervals of the acquired character string are arranged in order of size. A primary differentiation list creation part 306 creates a primary differentiation list indicating the amount of change before and after each character interval in the character interval list. A threshold determination part 307 determines the character interval in the character interval list corresponding to the maximum value in the created primary differentiation list as a threshold. A blank insertion part 308 inserts a blank character codes between characters having the character interval which is greater than or equal to the determined threshold.SELECTED DRAWING: Figure 3 |