发明名称 DOUBLE MAP REDUCE DISTRIBUTED COMPUTING FRAMEWORK
摘要 A method, apparatus, system, article of manufacture, and data structure provide the ability to perform a sorted map-reduce job on a cluster. A cluster of two or more computers is defined by installing a map-reduce framework onto each computer and formatting the cluster by identifying the cluster computers, establishing communication between them, and enabling the cluster to function as a unit. Data is placed into the cluster where it is distributed so that each computer contains a portion of the data. A first map function is performed where each computer sorts their respective data and creates an abstraction that is a representation of the data. The abstractions are exchanged and merged to create complete abstraction. A second map function searches the complete abstraction to redistribute and exchange the data across the computers in the cluster. A reduce function is performed in parallel to produce a result.
申请公布号 US2011066649(A1) 申请公布日期 2011.03.17
申请号 US20100882058 申请日期 2010.09.14
申请人 MYSPACE, INC. 发明人 BERLYANT MIKHAIL;RULE DANIEL STEPHEN;MILLER CHRISTOPHER EDWARD;LOK CYNTHIA
分类号 G06F15/16;G06F17/30 主分类号 G06F15/16
代理机构 代理人
主权项
地址