发明名称 Automatic clustering of tokens from a corpus for grammar acquisition
摘要 In a method of learning grammar from a corpus, context words are identified from a corpus. For the other non-context words, the method counts the occurrence of predetermined relationships which the context words, and maps the counted occurrences to a multidimensional frequency space. Clusters are grown from the frequency vectors. The clusters represent classes of words; words in the same cluster possess the same lexical significancy and provide an indicator of grammatical structure.
申请公布号 US2002002454(A1) 申请公布日期 2002.01.03
申请号 US20010912461 申请日期 2001.07.26
申请人 BANGALORE SRINIVAS;RICCARDI GIUSEPPE 发明人 BANGALORE SRINIVAS;RICCARDI GIUSEPPE
分类号 G06F17/27;G06K9/62;(IPC1-7):G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址