发明名称 Lexical association metric for knowledge-free extraction of phrasal terms
摘要 A method and system for determining a lexical association of phrasal terms are described. A corpus having a plurality of words is received, and a plurality of contexts including one or more context words proximate to a word in the corpus is determined. An occurrence count for each context is determined, and a global rank is assigned based on the occurrence count. Similarly, a number of occurrences of a word being used in a context is determined, and a local rank is assigned to the word-context pair based on the number of occurrences. A rank ratio is then determined for each word-context pair. A rank ratio is equal to the global rank divided by the local rank for a word-context pair. A mutual rank ratio is determined by multiplying the rank ratios corresponding to a phrase. The mutual rank ratio is used to identify phrasal terms in the corpus.
申请公布号 US8078452(B2) 申请公布日期 2011.12.13
申请号 US20100814730 申请日期 2010.06.14
申请人 DEANE PAUL;EDUCATIONAL TESTING SERVICE 发明人 DEANE PAUL
分类号 G06F17/27;G06F17/21;G06F17/28 主分类号 G06F17/27
代理机构 代理人
主权项
地址