发明名称 REDUCING CROSS QUEUE SYNCHRONIZATION ON SYSTEMS WITH LOW MEMORY LATENCY ACROSS DISTRIBUTED PROCESSING NODES
摘要 A method for efficient dispatch/completion of a work element within a multi-node data processing system. The method comprises: selecting specific processing units from among the processing nodes to complete execution of a work element that has multiple individual work items that may be independently executed by different ones of the processing units; generating an allocated processor unit (APU) bit mask that identifies at least one of the processing units that has been selected; placing the work element in a first entry of a global command queue (GCQ); associating the APU mask with the work element in the GCQ; and responsive to receipt at the GCQ of work requests from each of the multiple processing nodes or the processing units, enabling only the selected specific ones of the processing nodes or the processing units to be able to retrieve work from the work element in the GCQ.
申请公布号 US2011161975(A1) 申请公布日期 2011.06.30
申请号 US20090649667 申请日期 2009.12.30
申请人 IBM CORPORATION 发明人 ALEXANDER BENJAMIN G.;BELLOWS GREGORY H.;MADRUGA JOAQUIN;MINOR BARRY L.
分类号 G06F9/46;G06F9/308;G06F15/76 主分类号 G06F9/46
代理机构 代理人
主权项
地址