发明名称 METHOD AND APPARATUS FOR CLUSTERING DATA STREAM IN PROGRESS THROUGH ONLINE AND OFFLINE COMPONENTS
摘要 PROBLEM TO BE SOLVED: To improve cluster quality when data substantially proceeds with a lapse of time. SOLUTION: In regard to a technique performing data clustering of a data stream, first, online statistics are generated from a data stream. Thereafter, offline processing of the online statistics is performed when the offline processing is needed or desired. The online statistics can be generated through the reception of data points from the data stream, and the formation and the update of a data group. The offline processing can be performed by re-clustering the data point group in the periphery of the sampled data points, and a newly formed cluster is reported. COPYRIGHT: (C)2005,JPO&NCIPI
申请公布号 JP2005100363(A) 申请公布日期 2005.04.14
申请号 JP20040234267 申请日期 2004.08.11
申请人 INTERNATL BUSINESS MACH CORP <IBM> 发明人 AGGARWAL CHARU C;YU PHILIP SHI-LUNG
分类号 G06F17/30;G06F7/00;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址