发明名称 |
Semi-supervised part-of-speech tagging |
摘要 |
A word is selected from a received text and features are identified from the word. The features are applied to a model to identify probabilities for sets of part-of-speech tags. The probabilities for the sets of part-of-speech tags are used to weight scores for possible part-of-speech tags for the selected word to form weighted scores. The weighted scores are used to select a part-of-speech tag for the word and the selected part of speech tag is stored or output. The scores for the possible part-of-speech tags are based on variational approximation parameters trained from a sparse prior over probability distributions describing the probability of a part-of-speech tag given a word. |
申请公布号 |
US8275607(B2) |
申请公布日期 |
2012.09.25 |
申请号 |
US20070954212 |
申请日期 |
2007.12.12 |
申请人 |
TOUTANOVA KRISTINA NIKOLOVA;JOHNSON MARK EDWARD;MICROSOFT CORPORATION |
发明人 |
TOUTANOVA KRISTINA NIKOLOVA;JOHNSON MARK EDWARD |
分类号 |
G06F17/27 |
主分类号 |
G06F17/27 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|