摘要 |
PROBLEM TO BE SOLVED: To search a new phrase to be registered in a dictionary of a dividing means which breakes down a text into phrases. SOLUTION: This system inputs a text for learning into a dividing means to break down into phrases to produce break down candidates including the phrases different in combination according to the obtained break down reliability. It sums up the reliability of the break down candidates including those phrases for each phrase to find out their likelihood. Then, it finds out the combination minimizing the information entropy of the phrase considered to appear at the frequency matching the likelihood of the phrases in the combination within the extent that the text can be expressed by using the phrases included in a combination among the combinations of phrases included at least in one candidate, and to outputs it as a combination of phrases including the new phrase. COPYRIGHT: (C)2008,JPO&INPIT
|