发明名称 SYSTEM, METHOD, AND COMPUTER-READABLE MEDIUM FOR DYNAMIC DETECTION AND MANAGEMENT OF DATA SKEW IN PARALLEL JOIN OPERATIONS
摘要 A system, method, and computer-readable medium for dynamic detection and management of data skew in parallel join operations are provided. Receipt of an excessive number of redistributed rows by a processing module is detected thereby identifying the processing module as a hot processing module. Other processing modules then terminate redistribution of rows to the hot processing module and maintain rows of a skewed table of the join operation that would be redistributed to the hot processing module in a local spool. Rows of a smaller table that would be redistributed to the hot processing module are duplicated to each processing module involved in the join operation. Rows of tables that are to be redistributed by a processing module to any processing module excluding the hot processing module are redistributed accordingly and maintained locally by the processing module. The join operation is completed by merging results of local join data sets of each processing module.
申请公布号 US2009299956(A1) 申请公布日期 2009.12.03
申请号 US20080130060 申请日期 2008.05.30
申请人 XU YU;KOSTAMAA PEKKA;SIREK MARK 发明人 XU YU;KOSTAMAA PEKKA;SIREK MARK
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址