发明名称 System, method, and computer program product for collecting execution statistics for graphics processing unit workloads
摘要 A system, method, and computer program product are provided for collecting trace information based on a computational workload. The method includes the steps of compiling source code to generate a program, launching a workload to be executed by the parallel processing unit, collecting one or more records of trace information associated with a plurality of threads configured to execute the program, and correlating the one or more records to one or more corresponding instructions included in the source code. Each record in the one or more records includes at least a value of a program counter and a scheduler state of the thread.
申请公布号 US9535815(B2) 申请公布日期 2017.01.03
申请号 US201414296311 申请日期 2014.06.04
申请人 NVIDIA Corporation 发明人 Smith Gregory Paul;Nyland Lars Siegfried
分类号 G06F9/44;G06F11/36;G06F9/46;G06F9/50 主分类号 G06F9/44
代理机构 Zilka-Kotab, PC 代理人 Zilka-Kotab, PC
主权项 1. A method comprising: compiling source code to generate a program; launching a workload to be executed by a parallel processing unit, wherein the workload includes one or more tasks to be executed by the parallel processing unit, and at least one task of the one or more tasks executes a thread block configured to execute the program; collecting one or more records of trace information associated with a plurality of threads configured to execute the program; and correlating the one or more records to one or more corresponding instructions included in the source code, wherein each record in the one or more records includes a value of a program counter, a thread block identifier, and a scheduler state that comprises a stall vector having at least two bits, each bit in the at least two bits representing a different reason for a thread block to be stalled.
地址 Santa Clara CA US