发明名称 METHOD AND DEVICE FOR ANALYZING MORPHEME
摘要 PROBLEM TO BE SOLVED: To eliminate fuzziness that can not be solved by a minimum cost method by calculating a morpheme analysis result according to the normalization frequency in a corpus for an input character string, and extracting an analysis result through a word dictionary, independent word retrieval, an adjunct retrieval, a connection table, connection examination, unknown word segmentation, and analysis table generation. SOLUTION: A normalization frequency calculation part 11 calculates the normalization frequency in a corpus consisting of a large amount of electronic document information for an input character string 13. According to this normalization frequency, a cost calculation part 2 calculates the certainty of a morpheme analysis result and an independent word retrieval part 3 obtains grammatical information on an independent word by referring to the word dictionary 4. Further, an adjunct retrieval part 5 obtains grammatical information on an adjunct by referring to the word dictionary 4. Further, a connection examination part 6 examines a connection between morphemes by referring to the connection table 7 and an unknown word segmentation part 9 segments an unknown word candidate character string and adds it to independent word candidates. The analysis table is generated through those processes and an analysis result is extracted and outputted.
申请公布号 JPH10240735(A) 申请公布日期 1998.09.11
申请号 JP19970043955 申请日期 1997.02.27
申请人 MITSUBISHI ELECTRIC CORP 发明人 AIKAWA TAKEYUKI;HOSODA HARUMI;TAKAYAMA YASUHIRO
分类号 G06F17/21;G06F17/22;G06F17/27 主分类号 G06F17/21
代理机构 代理人
主权项
地址