发明名称 Partitioning of data mining training set
摘要 A system that effectuates fetching a complete set of relational data into a mining services server and subsequently defining desired partitions upon the fetched data is provided. In accordance with the innovation, the data can be locally cached and partitioned therefrom. Accordingly, upon the same mining structure (e.g., cache) that has been partitioned, the novel innovation can build mining models for each partition. In other words, the innovation can employ the concept of mining structure as a data cache while manipulating only partitions of this cache in certain operations. The innovation can be employed in scenarios where a user wants to train a mining model using only data points that satisfy a particular Boolean condition, a user wants to split the training set into multiple partitions (e.g., training/testing) and/or a user wants to perform a data mining procedure known as “N-fold cross validation.”
申请公布号 US7756881(B2) 申请公布日期 2010.07.13
申请号 US20060371477 申请日期 2006.03.09
申请人 MICROSOFT CORPORATION 发明人 CRIVAT IOAN BOGDAN;IYER RAMAN S.;MACLENNAN C. JAMES
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 代理人
主权项
地址