发明名称 SYSTEM AND METHOD FOR CLUSTERING DATA IN INPUT AND OUTPUT SPACES
摘要 A method of clustering a plurality of documents having input and output space data is disclosed that uses both input and output space criteria. The method can include aggregating documents into clusters based on input and/or output space similarity measures, and then refining the clusters based on further input and/or output space similarity measures. Aggregating the documents into clusters can include forming a hierarchical tree based on the input and/or output space similarity measures where the hierarchical tree has a root node, branching into intermediate nodes, and branching into leaf nodes covering individual documents, where the hierarchical tree includes a leaf node for each document of the plurality of documents. The method can then include forming a forest of sub-trees of the hierarchical tree based on cluster criteria. Textual and numeric similarity measures can be used depending on the type and distribution of data in the input and output spaces.
申请公布号 WO2014150593(A1) 申请公布日期 2014.09.25
申请号 WO2014US23729 申请日期 2014.03.11
申请人 ROBERT BOSCH GMBH;HEIT, JUERGEN;DEY, SANJOY;SRINIVASAN, SOUNDARARAJAN 发明人 HEIT, JUERGEN;DEY, SANJOY;SRINIVASAN, SOUNDARARAJAN
分类号 G06F17/21;G06F17/30 主分类号 G06F17/21
代理机构 代理人
主权项
地址