发明名称 |
Automatic clustering of tokens from a corpus for grammar acquisition |
摘要 |
In a method of learning grammar from a corpus, context words are identified from a corpus. For the other non-context words, the method counts the occurrence of predetermined relationships which the context words, and maps the counted occurrences to a multidimensional frequency space. Clusters are grown from the frequency vectors. The clusters represent classes of words; words in the same cluster possess the same lexical significancy and provide an indicator of grammatical structure.
|
申请公布号 |
US2002002454(A1) |
申请公布日期 |
2002.01.03 |
申请号 |
US20010912461 |
申请日期 |
2001.07.26 |
申请人 |
BANGALORE SRINIVAS;RICCARDI GIUSEPPE |
发明人 |
BANGALORE SRINIVAS;RICCARDI GIUSEPPE |
分类号 |
G06F17/27;G06K9/62;(IPC1-7):G06F17/27 |
主分类号 |
G06F17/27 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|