发明名称 Identifying non-compositional compounds
摘要 Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying non-compositional compounds. In one aspect, a method includes the actions of receiving a collection of phrases, each phrase including two or more words; for each phrase, determining if the phrase is a non-compositional compound, a non-compositional compound being a phrase of two or more words where the words composing the phrase have different meanings in a compound than their conventional meanings individual, the determining including: identifying a similar term for a term of the phrase, substituting the similar term for the term of the phrase to generate a substitute phrase, calculating a similarity between the phrase and the substitute phrase, and identifying the phrase as a non-compositional compound when the calculated similarity is less than a specified threshold value.
申请公布号 US8572081(B1) 申请公布日期 2013.10.29
申请号 US201213361565 申请日期 2012.01.30
申请人 YANG STEWART;LIU FANG;CAO PEI;GOOGLE INC. 发明人 YANG STEWART;LIU FANG;CAO PEI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址