发明名称 Data shuffling in a non-uniform memory access device
摘要 A method of orchestrated shuffling of data in a non-uniform memory access device that includes a plurality of processing nodes includes running an application on a plurality of threads executing on the plurality of processing nodes and identifying data to be shuffled from source threads running on source processing nodes among the processing nodes to target threads executing on target processing nodes among the processing nodes. The method further includes generating a plan for orchestrating the shuffling of the data among the all of the memory devices associated with the threads and shuffling the data among all of the memory devices based on the plan.
申请公布号 US9274835(B2) 申请公布日期 2016.03.01
申请号 US201414147912 申请日期 2014.01.06
申请人 International Business Machines Corporation 发明人 Li Yinan;Lohman Guy M.;Mueller Rene;Pandis Ippokratis;Raman Vijayshankar
分类号 G06F9/46;G06F9/50 主分类号 G06F9/46
代理机构 Cantor Colburn LLP 代理人 Cantor Colburn LLP ;Butler Bryan
主权项 1. A method of orchestrated shuffling of data in a non-uniform memory access device that includes a plurality of processing nodes, the method comprising: running an application on a plurality of threads executing on the plurality of processing nodes, wherein running the application includes dividing data on each thread into partitions according to a target thread on which the data is to be processed; identifying, by the threads, data to be shuffled from source threads running on source processing nodes among the processing nodes to target threads running on target processing nodes among the processing nodes; generating a plan for orchestrating shuffling of the data among the a plurality of memory devices associated with the plurality of processing nodes, the plan including utilizing a data-shifting table to identify an order in which the data partitions are to be transferred from the source threads of the source processing nodes to the target threads of the target processing nodes; shuffling the data among the plurality of memory devices associated with the plurality of processing nodes based on the plan by transmitting data partitions from the plurality of source threads to the plurality of target threads according to the data-shifting table, wherein the data is transmitted simultaneously among the plurality of processing nodes; shifting the data-shifting table to associate each source thread with a different target thread; and transmitting another set of data partitions from the plurality of source threads to the plurality of target threads based on shifting the data-shifting table.
地址 Armonk NY US