摘要 |
The invention improves the accuracy of extraction of words that are included in a dictionary but are ambiguous, and words that are not included in the dictionary. A training data generating device is provided with: a storage unit which stores manually-generated dictionary data; an input device which accepts as input text to be learned, without tags; and a processing unit which, on the basis of word-meaning information indicating a meaning classification of words contained in the dictionary data, generates text with tags from the text to be learned, without tags. |