发明名称 DATA CLUSTERING BASED ON VARIANT TOKEN NETWORKS
摘要 Received data records, each including one or more values in one or more fields, are processed to identify a matched data cluster. The processing includes: for selected data records, generating a query from one or more values; identifying one or more candidate data records from the received data records using the query; determining whether or not the selected data record satisfies a cluster membership criterion for at least one candidate data cluster of one or more existing data clusters containing the candidate records; and selecting the matched data cluster from among one or more candidate data clusters based at least in part on a growth criterion for the candidate data clusters, or initializing the matched data cluster with the selected data record if the selected data record does not satisfy a cluster membership criterion for any of the existing data clusters or based on a result of the growth criterion.
申请公布号 KR20140094003(A) 申请公布日期 2014.07.29
申请号 KR20147016338 申请日期 2012.11.15
申请人 AB INITIO TECHNOLOGY LLC 发明人 ANDERSON ARLEN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址