发明名称 IDENTIFYING WORD-SENSES BASED ON LINGUISTIC VARIATIONS
摘要 One or more words are received. A set of frequency of occurrence values of the received word(s) within a set of domain tables is determined. A domain table in the set of domain tables is associated to the received word(s), based on the set of frequency of occurrence values meeting a threshold value. A word-sense of the received word(s) is determined based on a corresponding word-sense in the associated domain table and/or corresponding domain dictionary.
申请公布号 US2017124068(A1) 申请公布日期 2017.05.04
申请号 US201715404241 申请日期 2017.01.12
申请人 International Business Machines Corporation 发明人 Bishop Timothy A.;Boxwell Stephen A.;Brumfield Benjamin L.;Desai Nirav P.;Vernier Stanley J.
分类号 G06F17/27;G06F19/00 主分类号 G06F17/27
代理机构 代理人
主权项 1. A computer program product for identifying word-senses, the computer program product comprising: one or more computer-readable storage media and program instructions stored on the one or more computer-readable storage media, the program instructions comprising: program instructions to generate a set of domain tables each comprising one or more arrays of aggregated statistical information corresponding to a plurality of words, one or more word-senses corresponding to the plurality of words, and temporal properties corresponding to the plurality of words, wherein the aggregated statistical information comprises a temporal frequency of occurrence value determined using an n-gram viewer; program instructions to receive a word; program instructions to identify the temporal frequency of occurrence value corresponding to the received word from each domain table in the set of domain tables; program instructions to associate the received word with one or more domain tables in the set of domain tables based on the temporal frequency of occurrence value corresponding to the received word in each of the one or more domain tables meeting a threshold value; and program instructions to identify one or more word-senses corresponding to the received word based on one or more corresponding word-senses in the associated one or more domain tables and based on one or more corresponding word-senses in a corresponding domain dictionary.
地址 Armonk NY US