发明名称 System, method, and computer-readable medium that facilitate in-database analytics with supervised data discretization
摘要 A system, method, and computer-readable medium that facilitate in-database supervised discretisation mechanisms which improve data classification are provided. The disclosed mechanisms provide an efficient, automatic, and repeatable way to perform data discretisation without human intervention. Efficient processing of large and complex unknown data is provided that advantageously does not require the data being analyzed to be processed outside the database. The disclosed mechanisms may use an External Stored Procedure to avoid multiple joins of large tables and minimize the number of full table scans and, consequently, provide better performance than contemporary mechanisms. The disclosed system produces intermediate results in tables which may be conveyed to a visualization subsystem thereby providing users a better understanding of the data distribution in each category. Further, the disclosed system and method introduce a novel similarity-based solution to merge intervals when chi-square testing is not reliable and thereby improves the quality of the interval merge process.
申请公布号 US8135667(B2) 申请公布日期 2012.03.13
申请号 US20090651086 申请日期 2009.12.31
申请人 LUO CONGNAN;TERADATA US, INC. 发明人 LUO CONGNAN
分类号 G06F7/00 主分类号 G06F7/00
代理机构 代理人
主权项
地址