发明名称 Method and system for sample data selection to test and train predictive algorithms of customer behavior
摘要 A method and system for sample data selection to test and train predictive algorithm of customer behavior are provided. The method and system generate frequency distributions of a customer database data set, training data set and testing data set and compare the frequency distributions of geographical characteristics to determine if there are discrepancies. If the discrepancies are above a predetermined tolerance, one or more of the data sets may not be representative of the customer database taking into account geographical influences on customer behavior. Thus, recommendations for improving the training data set and/or testing data set are then provided such that the data set is more representative of the customer database. In this way, "nuggeting" of customers is accounted for in the training and/or testing data sets.
申请公布号 US7080052(B2) 申请公布日期 2006.07.18
申请号 US20010838732 申请日期 2001.04.19
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 BUSCHE FREDERICK D.
分类号 G06N5/00;G06Q30/00 主分类号 G06N5/00
代理机构 代理人
主权项
地址