发明名称 |
STREAM-BASED ACCELERATOR PROCESSING OF COMPUTATIONAL GRAPHS |
摘要 |
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for receiving, by a computational graph system, a request to process a computational graph; obtaining data representing a subgraph of the computational graph, the computational graph comprising a plurality of nodes and directed edges, wherein each node represents a respective operation, wherein each directed edge connects a respective first node to a respective second node, the subgraph assigned to a first device by a placer in the computational graph system; determining that the first device comprises a hardware accelerator having a plurality of streams; in response to determining, generating instructions that when executed by the first device cause the first device to: assign the operation represented by each node in the subgraph to a respective stream; and perform the operations represented by the nodes in the subgraph in accordance with the assignment. |
申请公布号 |
US2017124451(A1) |
申请公布日期 |
2017.05.04 |
申请号 |
US201615336673 |
申请日期 |
2016.10.27 |
申请人 |
Google Inc. |
发明人 |
Barham Paul Ronald;Vasudevan Vijay |
分类号 |
G06N3/08;G06N99/00 |
主分类号 |
G06N3/08 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method comprising:
receiving, by a computational graph system, a request to process a computational graph; obtaining data representing a subgraph of the computational graph, the computational graph comprising a plurality of nodes and directed edges, wherein each node represents a respective operation, wherein each directed edge connects a respective first node to a respective second node that represents an operation that receives, as input, an output of an operation represented by the respective first node, the subgraph assigned to a first device by a placer in the computational graph system; determining that the first device comprises a hardware accelerator having a plurality of streams; in response to determining that the first device comprises a hardware accelerator having a plurality of streams, generating instructions that when executed by the first device cause the first device to:
assign the operation represented by each node in the subgraph to a respective stream in the plurality of streams of the hardware accelerator; andperform the operations represented by the nodes in the subgraph in accordance with the assignment; and providing the instructions and the data to the first device. |
地址 |
Mountain View CA US |