发明名称 DATA CLUSTERING, SEGMENTATION, AND PARALLELIZATION
摘要 A first set of original records is processed by a first processing entity to generate a second set of records that includes the original records and one or more copies of each original record, each original record including one or more fields. The processing of each of at least some of the original records includes (402): generating at least one copy of the original record, and associating a first segment value with the original record and associating a second segment value with the copy. The method also includes (404) partitioning the second set of records among a plurality of recipient processing entities based on the segment values associated with the records in the second set, and, at each recipient processing entity, performing an operation based on one or more data values of the records received at the recipient processing entity to generate results.
申请公布号 CA2855701(A1) 申请公布日期 2013.05.23
申请号 CA20122855701 申请日期 2012.11.15
申请人 AB INITIO TECHNOLOGY LLC 发明人 ANDERSON, ARLEN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址