发明名称 COOPERATIVE THREAD ARRAY REDUCTION AND SCAN OPERATIONS
摘要 One embodiment of the present invention sets forth a technique for performing aggregation operations across multiple threads that execute independently. Aggregation is specified as part of a barrier synchronization or barrier arrival instruction, where in addition to performing the barrier synchronization or arrival, the instruction aggregates (using reduction or scan operations) values supplied by each thread. When a thread executes the barrier aggregation instruction the thread contributes to a scan or reduction result, and waits to execute any more instructions until after all of the threads have executed the barrier aggregation instruction. A reduction result is communicated to each thread after all of the threads have executed the barrier aggregation instruction and a scan result is communicated to each thread as the barrier aggregation instruction is executed by the thread.
申请公布号 US2014019724(A1) 申请公布日期 2014.01.16
申请号 US201314025482 申请日期 2013.09.12
申请人 NVIDIA CORPORATION 发明人 FAHS BRIAN;SIU MING Y.;COON BRETT W.;NICKOLLS JOHN R.;NYLAND LARS
分类号 G06F9/30 主分类号 G06F9/30
代理机构 代理人
主权项
地址