摘要 |
PURPOSE:To attain highly accurate morpheme division or the like by dividing a charac ter string based upon character sort changing information, and when morphemes is disconnected, specifying an unregistered word by the category of its part of speech, the sorts of characters and the number of characters. CONSTITUTION:A dictionary retrieving character string forming processing part 2 extracts all continued characters from an input character string and forms a partial character string for dictionary retrieval and a dictionary retrieving processing part 3 retrieves the partial character string from an independent word dictionary and an adjunct dictionary and determines its part of speech. Then an inter-morpheme connection checking processing part 4 checks the connecting condition of adjacent morphemes by a morpheme connection dictionary and a word table registering processing part 5 registers connectable morphemes. A character sort change point determining processing part 6 divides the input character string on a character sort changing position, and when an unregistered word exists, an unregistered word range determining processing part 7 determines the range of the character string including the unregistered word and an unregistered word range assembling processing part 9 specifies the unregistered word based upon the sort of the morpheme as a part of speech, the number characters in the morpheme, and so on. |