发明名称 Morphological analyzer, morphological analysis method, and morphological analysis program
摘要 An input text is analyzed into morphemes by using a prescribed morphological analysis procedure to generate word strings with part-of-speech tags, including form information for parts of speech having forms, as hypotheses. The probabilities of occurrence of each hypothesis in a corpus of text are calculated by use of two or more part-of-speech n-gram models, at least one of which takes the forms of the parts of speech into consideration. Lexicalized models and class models may also be used. The models are weighted and the probabilities are combined according to the weights to obtain a single probability for each hypothesis. The hypothesis with the highest probability is selected as the solution to the morphological analysis. By combining multiple models, this method can resolve ambiguity with a higher degree of accuracy than methods that use only a single model.
申请公布号 US2004243409(A1) 申请公布日期 2004.12.02
申请号 US20040812000 申请日期 2004.03.30
申请人 OKI ELECTRIC INDUSTRY CO., LTD. 发明人 NAKAGAWA TETSUJI
分类号 G06F17/27;G06F17/28;G10L15/18;(IPC1-7):G10L15/12 主分类号 G06F17/27
代理机构 代理人
主权项
地址