发明名称 MAPREDUCE FOR DISTRIBUTED DATABASE PROCESSING
摘要 An input data set is treated as a plurality of grouped sets of key/value pairs, which enhances the utility of the MapReduce programming methodology. By utilizing such a grouping, map processing can be carried out independently on two or more related but possibly heterogeneous datasets (e.g., related by being characterized by a common primary key). The intermediate results of the map processing (key/value pairs) for a particular key can be processed together in a single reduce function by applying a different iterator to intermediate values for each group. Different iterators can be arranged inside reduce functions in ways however desired.
申请公布号 US2008086442(A1) 申请公布日期 2008.04.10
申请号 US20060539090 申请日期 2006.10.05
申请人 YAHOO! INC. 发明人 DASDAN ALI;YANG HUNG-CHIH;HSIAO RUEY-LUNG
分类号 G06F17/30;G06F7/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址