发明名称 |
DEVICE, METHOD, AND PROGRAM FOR WORD SENSE ESTIMATION |
摘要 |
A device and method to estimate a word sense with high accuracy by unsupervised learning. A word sense estimation device executes a plurality of number of times a probability calculation of calculating an evaluation value for each word of a case where each concept extracted as a word sense candidate is determined as a word sense, based on a proximity between a context feature of a selected word and a context feature of another word, a proximity between a selected concept and a word sense of this another word, and a probability that the selected word takes a selected word sense, and of re-calculating the probability based on the evaluation value calculated, and estimates a concept with a higher probability calculated of said each word, to be a word sense of the word. |
申请公布号 |
US2015006155(A1) |
申请公布日期 |
2015.01.01 |
申请号 |
US201214366066 |
申请日期 |
2012.03.07 |
申请人 |
Tanigaki Koichi;Shiba Mitsuteru;Takayama Shigenobu |
发明人 |
Tanigaki Koichi;Shiba Mitsuteru;Takayama Shigenobu |
分类号 |
G06F17/28;G06F17/27 |
主分类号 |
G06F17/28 |
代理机构 |
|
代理人 |
|
主权项 |
1. A word sense estimation device comprising:
a word extraction part which extracts a plurality of words included in input data; a context analysis part which extracts, for each word extracted by the word extraction part, a context feature of a context in which the word appears in the input data; a word sense candidate extraction part which extracts each concept stored as a word sense of said each word, as a word sense candidate of said each word, from a concept dictionary storing at least one concept as a word sense of a word; and a word sense estimation part which executes a plurality of number of times a probability calculation of calculating an evaluation value for said each word of a case where said each concept extracted as the word sense candidate by the word sense candidate extraction part is determined as a word sense, based on a proximity between the context feature of a selected word and the context feature of another word, a proximity between a selected concept and a concept of a word sense candidate of said another word, and a probability that the selected word takes a selected word sense, and of re-calculating the probability based on the evaluation value calculated, and which estimates a concept with a higher probability calculated of said each word, to be a word sense of the word. |
地址 |
Tokyo JP |