发明名称 HIERARCHICAL SEQUENTIAL CLUSTERING
摘要 Embodiments of the invention provide systems and methods for analyzing sequential data. Analyzing the sequential data can include grouping or clustering data that are similar in some way, e.g., similar ranges of quantities, similar categories, etc. More specifically, a method for hierarchical clustering of sequential data can comprise creating a dotplot of the sequential data. The dotplot can represent a plurality of sequences within the sequential data. A number of clusters represented by the plurality of sequences can be initialized, e.g., one cluster per sequence. A pair of sequences of the plurality of sequences having a longest sequential match can be identified, e.g., based on a line fitting technique, and merged into a single cluster. Identifying a pair of sequences of the plurality of sequences having a longest sequential match and merging the identified pair of sequences into a single cluster can be repeated until a single cluster remains.
申请公布号 US2011078144(A1) 申请公布日期 2011.03.31
申请号 US20100831615 申请日期 2010.07.07
申请人 ORACLE INTERNATIONAL CORPORATION 发明人 HELFMAN JONATHAN;GOLDBERG JOSEPH H.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址