发明名称 STREAM-BASED ACCELERATOR PROCESSING OF COMPUTATIONAL GRAPHS
摘要 Methods, systems, and apparatus, including computer programs encoded on computer storage media, for receiving, by a computational graph system, a request to process a computational graph; obtaining data representing a subgraph of the computational graph, the computational graph comprising a plurality of nodes and directed edges, wherein each node represents a respective operation, wherein each directed edge connects a respective first node to a respective second node, the subgraph assigned to a first device by a placer in the computational graph system; determining that the first device comprises a hardware accelerator having a plurality of streams; in response to determining, generating instructions that when executed by the first device cause the first device to: assign the operation represented by each node in the subgraph to a respective stream; and perform the operations represented by the nodes in the subgraph in accordance with the assignment.
申请公布号 US2017124451(A1) 申请公布日期 2017.05.04
申请号 US201615336673 申请日期 2016.10.27
申请人 Google Inc. 发明人 Barham Paul Ronald;Vasudevan Vijay
分类号 G06N3/08;G06N99/00 主分类号 G06N3/08
代理机构 代理人
主权项 1. A method comprising: receiving, by a computational graph system, a request to process a computational graph; obtaining data representing a subgraph of the computational graph, the computational graph comprising a plurality of nodes and directed edges, wherein each node represents a respective operation, wherein each directed edge connects a respective first node to a respective second node that represents an operation that receives, as input, an output of an operation represented by the respective first node, the subgraph assigned to a first device by a placer in the computational graph system; determining that the first device comprises a hardware accelerator having a plurality of streams; in response to determining that the first device comprises a hardware accelerator having a plurality of streams, generating instructions that when executed by the first device cause the first device to: assign the operation represented by each node in the subgraph to a respective stream in the plurality of streams of the hardware accelerator; andperform the operations represented by the nodes in the subgraph in accordance with the assignment; and providing the instructions and the data to the first device.
地址 Mountain View CA US
您可能感兴趣的专利