发明名称 Disambiguation of term occurrences
摘要 A method for extracting information from a corpus of data includes specifying a topic and a query term associated with the topic, and defining adjunct terms which may occur in the corpus in a context of the query term, the adjunct terms comprising one or more off-topic terms. Occurrences of the query term are found in the corpus, the occurrences including at least one occurrence of the query term together with at least one of the off-topic terms in the context of the query term. The at least one occurrence of the query term is classified as non-relevant to the topic responsively to the occurrence of the at least one of the off-topic terms in the context of the query term.
申请公布号 US7260571(B2) 申请公布日期 2007.08.21
申请号 US20030440883 申请日期 2003.05.19
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 AMITAY EINAT;NELKEN RANI;WAYNE NIBLACK;SMITH DAVID C;SOFFER AYA
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址