摘要 |
<p>PURPOSE:To accurately decide the unknown words in a HIRAGANA (cursive form of Japanese syllabary) character string by supposing that the unknown words are equal to independent words when these unknown words are processed in analysis of the morpheme of a KANJI (Chinese characters)-KANA (Japanese syllabary) Japanese sentence. CONSTITUTION:If the head character of an unknown word part is HIRAGANA, a word is extracted out of those words following the head one for search of a postpositional word. When said postpositional word is searched, it is checked whether or not the character right after the postpositional word has a change of character type. If so, the characters covering the head one through the one right before the postpositional word are defined as an unknown word. If no change of character type is detected, a word is extracted from those words immediately after the postpositional word. Then the characters covering the head one through the one right before the postpositional word are decided as an unknown word when just a single candidate word is detected.</p> |