发明名称 USING A GLOBAL BARRIER TO SYNCHRONIZE ACROSS LOCAL THREAD GROUPS IN GENERAL PURPOSE PROGRAMMING ON GPU
摘要 Methods and systems may synchronize workloads across local thread groups. The methods and systems may provide for receiving, at a graphics processor, a workload from a host processor and receiving, at a plurality of processing elements, a plurality of threads that from one or more local thread groups. Additionally, the processing of the workload may be synchronized across the one or more thread groups. In one example, the global barrier determines that all threads across the thread groups have been completed without polling.
申请公布号 US2015187042(A1) 申请公布日期 2015.07.02
申请号 US201414563601 申请日期 2014.12.08
申请人 Intel Corporation 发明人 Gupta Niraj
分类号 G06T1/20;G06F9/38 主分类号 G06T1/20
代理机构 代理人
主权项 1. A system comprising: one or more transceivers; a host processor in communication with the one or more transceivers; a system memory associated with the host processor; a processor, in communication with the system memory, to receive a workload from the host processor and including: a plurality of processing elements to receive at least one of a plurality of threads, the plurality of threads to form one or more thread groups, and a global barrier in communication with the plurality of processing elements, the global barrier to synchronize the processing of the workload across the one or more thread groups.
地址 Santa Clara CA US