发明名称 SYSTEM AND METHODS FOR INTERACTIVE DISPLAYS BASED ON ASSOCIATIONS FOR MACHINE-GUIDED RULE CREATION
摘要 This disclosure provides a computer-program product, system, method and apparatus for accessing a representation of a category or item and accessing a set of multiple transactions. The transactions are processed to identify items found amongst the transactions, and the items are ordered based on an information-gain heuristic. A depth-first search for a group of best association rules is then conducted using a best-first heuristic and constraints that make the search efficient. The best rules found during the search can then be displayed to a user, along with accompanying statistics. The user can then select rules that appear to be most relevant, and further analytics can be applied to the selected rules to obtain further information about the information provided by these rules.
申请公布号 US2015193523(A1) 申请公布日期 2015.07.09
申请号 US201514662443 申请日期 2015.03.19
申请人 SAS Institute Inc. 发明人 Cox James Allen;Zhao Zheng;Barnes Arila;Peterson Jared;DuPont Samantha;Albright Russel
分类号 G06F17/30;G06N5/02 主分类号 G06F17/30
代理机构 代理人
主权项 1. A non-transitory computer-readable storage medium having instructions stored thereon, the instructions executable to cause a data processing apparatus to perform operations including: accessing a representation of a document category; accessing a set of multiple documents, each of the documents in the set including a label indicating whether or not the document is included in the category; assembling a list of terms, wherein the terms include terms found in the documents of the set; and evaluating, using a graph search algorithm, association rules in a search space that includes the evaluated association rules and unevaluated association rules, wherein each of the evaluated association rules and each of the unevaluated association rules includes at least one of the terms in the list, wherein evaluating association rules includes performing the following computer operations with respect to each of the evaluated association rules: obtaining categorization results by using the evaluated association rule to individually categorize documents of the set; andestimating a precision of the evaluated association rule based on the categorization results; and selecting some of the evaluated association rules based on the precision estimated with respect to each of the evaluated association rules; displaying a tree graph on a computer display screen such that the tree graph includes a root node and additional nodes, wherein: the root node represents the document category;each of the additional nodes represents one of the selected association rules and the respective estimated precision; andedges of the tree graph connect nodes that represent selected association rules sharing terms in common.
地址 Cary NC US