发明名称 SYSTEM AND METHOD FOR ANALYZING CLUSTER RESULTS OF LARGE AMOUNTS OF DATA
摘要 <p>The present invention relates to a system and method for analyzing the cluster results of large amounts of data. The method uses an open source MapReduce framework called Hadoop in order to calculate silhouette coefficients, which are significance test indexes capable of evaluating the cluster results of large amounts of data. In order to implement same, clustered data are divided into blocks, and input splits are created for all of the blocks. Also, the created input splits are allocated to a plurality of computers, and each of the computers stores the data of the blocks included in the input splits to a memory to calculate silhouette coefficients for each record and provides the calculated silhouette coefficients to a characteristic coefficient calculator to obtain silhouette coefficients for clusters. Thus, cluster results of large amounts of data are effectively analyzed quickly and independently.</p>
申请公布号 WO2013151221(A1) 申请公布日期 2013.10.10
申请号 WO2012KR08986 申请日期 2012.10.31
申请人 SK PLANET CO., LTD. 发明人 LEE, CHAE HYUN;KIM, MIN SOENG;LEE, JUN SUP
分类号 G06F17/00 主分类号 G06F17/00
代理机构 代理人
主权项
地址