摘要 |
<P>PROBLEM TO BE SOLVED: To adequately process a natural language based on machine learning using a corpus text including an anaphoric relationship in which an anaphoric word such as a demonstrative, a zero pronoun or the like loses vocabulary information of an antecedent. <P>SOLUTION: Learning data are subjected to anaphoric-and-analysis processing relating to at least either a demonstrative or a zero pronoun, and an important expression carrying the theme of a text, which is made to be a pronoun or zero pronoun in the learning data, is returned to an original language form to use it as the learning data, and consequently learning which more clearly captures the meaning of the text is realized. In a machine learning method using a parallel corpus, learning which more intensely reflects the intention of a learner is realized. <P>COPYRIGHT: (C)2005,JPO&NCIPI |