发明名称 Runtime optimization of an application executing on a parallel computer
摘要 Identifying a collective operation within an application executing on a parallel computer; identifying a call site of the collective operation; determining whether the collective operation is root-based; if the collective operation is not root-based: establishing a tuning session and executing the collective operation in the tuning session; if the collective operation is root-based, determining whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes identified the collective operation at the same call site, establishing a tuning session and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.
申请公布号 US8893150(B2) 申请公布日期 2014.11.18
申请号 US201313767997 申请日期 2013.02.15
申请人 International Business Machines Corporation 发明人 Faraj Daniel A.;Smith Brian E.
分类号 G06F9/46;G06F15/16;G06F9/54;G06F9/52;G06F11/34 主分类号 G06F9/46
代理机构 Biggers Kennedy Lenart Spraggins LLP 代理人 Biggers Kennedy Lenart Spraggins LLP
主权项 1. A method of runtime optimization of an application executing on a parallel computer, the parallel computer having a plurality of compute nodes organized into a communicator, the method comprising: determining, by each compute node, whether a collective operation is root-based; if the collective operation is not root-based, establishing a tuning session administered by a self tuning module for the collective operation in dependence upon an identifier of a call site of the collective operation and executing the collective operation in the tuning session; if the collective operation is root-based, determining, through use of a single other collective operation, whether all compute nodes executing the application identified the collective operation at the same call site; if all compute nodes executing the application identified the collective operation at the same call site, establishing a tuning session administered by the self tuning module for the collective operation in dependence upon the identifier of the call site of the collective operation and executing the collective operation in the tuning session; and if all compute nodes executing the application did not identify the collective operation at the same call site, executing the collective operation without establishing a tuning session.
地址 Armonk NY US