发明名称 CORPUS SEARCH SYSTEMS AND METHODS
摘要 A corpus of texts relating to a domain of knowledge may be searched by determining noun-pair proximity scores measuring associations between pairs of nouns that appear in the corpus and that are semantically related to the domain of knowledge. When a search term is received, the noun-pair proximity scores may be used (at least in part) to identify one or more related nouns that are strongly associated with the search term within the corpus. One or more texts may be selected from the corpus, texts in which the search term and the related nouns appear near each other in one or more places. The selected texts may be categorized and/or clustered based on the related nouns before being returned for presentation as SearchResults.
申请公布号 US2015261850(A1) 申请公布日期 2015.09.17
申请号 US201414216059 申请日期 2014.03.17
申请人 NLPCore LLC 发明人 MITTAL Varun
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer-implemented method for searching a corpus of texts relating to a domain of knowledge, the method comprising: determining, by said computer, a multiplicity of noun-pair proximity scores measuring associations between pairs of nouns that appear in said texts and that are semantically related to said domain of knowledge; obtaining, by said computer, a search term related to said domain of knowledge; identifying, by said computer based at least in part on said multiplicity of noun-pair proximity scores, a related noun that is strongly associated with said search term within said corpus; selecting, by said computer, from said corpus a plurality of texts, in each of which said search term and said related noun appear near each other in at least one place; and providing, by said computer, data associated with said plurality of selected texts for presentation as search results.
地址 Seattle WA US