发明名称 METHOD FOR RECOGNIZING CHARACTER STRING OF JAPANESE PROSAIC OR COLLOQUIAL SENTENCE AS WORD STREAM BY COMPUTER PROCESSING AND SOFTWARE RECORDING MEDIUM
摘要 PROBLEM TO BE SOLVED: To make a computer recognize a sentence such as prose including colloquial expressions as a word. SOLUTION: A database storing a lot of sample word sets prepared on the basis of a lot of sample sentences is prepared. A subject constitutive word composing of the subject of a processing object sentence composed of KANA/ KANJI mixed character strings is extracted. The database is retrieved with the subject constitutive word as a keyword and the word set including this word is extracted as a subject related sample word set. It is retrieved whether the word included in the subject related sample word set is included in the character string of the processing object or not and when such a word is included, it is recognized as a word and breaks are inserted before and after that word.
申请公布号 JP2001051993(A) 申请公布日期 2001.02.23
申请号 JP19990229086 申请日期 1999.08.13
申请人 GALA INC 发明人 KIKUKAWA AKIRA
分类号 G06F17/22 主分类号 G06F17/22
代理机构 代理人
主权项
地址