发明名称 Methods, systems, and computer program products for integrated world wide web query classification
摘要 Implementing query classification includes executing a reductionist module on a query to extract a core term, which term is used to search a hash table that maps core terms to corresponding categories, deriving a first result including one of the categories from the search, and executing an enrichment module on the query to yield a second result. The enrichment module includes searching an index of terms that are mapped to documents and corresponding categories. Upon determining the core term is present in the hash table, a weighted average is calculated for values of the first and second results based on training data. Upon determining the core term from the query is not in the hash table, and also that a probability score of the category in the index for the second result meets a minimum confidence value, the core term and the corresponding categories are stored in the hash table.
申请公布号 US9465862(B2) 申请公布日期 2016.10.11
申请号 US201514614606 申请日期 2015.02.05
申请人 AT&T INTELLECTUAL PROPERTY I, L.P. 发明人 Agrawal Ritesh Jitendra;King Irwin;Zajac Remi
分类号 G06F17/30;G06N7/00;G06N99/00 主分类号 G06F17/30
代理机构 Cantor Colburn LLP 代理人 Cantor Colburn LLP
主权项 1. A method for integrating query categories, comprising: executing, at a computer, a reductionist module on a search query to extract a core term from the search query, the core term used to search a hash table that maps core terms to corresponding categories; deriving a first result comprising at least one of the categories from the search of the hash table; executing at the computer an enrichment module on the search query to yield a second result, the enrichment module including searching an index of terms that are mapped to documents and corresponding categories in the index, the second result indicative of one of the corresponding categories in the index based on a probability score; upon determining the core term is present in the hash table, calculating a weighted average for corresponding values of the first result and the second result based on training data acquired from the execution of the reductionist module and the execution of the enrichment module, the calculated weighted average stored in a memory device; and upon determining the core term from the search query is not listed in the hash table, and upon determining the probability score of the one of the corresponding categories in the index for the second result meets a minimum defined confidence value, inserting and storing the core term and the one of the corresponding categories in the hash table and mapping the core term to the one of the corresponding categories in the hash table.
地址 Atlanta GA US