发明名称 |
DEEP NEURAL NETWORK PARTITIONING ON SERVERS |
摘要 |
A method is provided for implementing a deep neural network on a server component that includes a host component including a CPU and a hardware acceleration component coupled to the host component. The deep neural network includes a plurality of layers. The method includes partitioning the deep neural network into a first segment and a second segment, the first segment including a first subset of the plurality of layers, the second segment including a second subset of the plurality of layers, configuring the host component to implement the first segment, and configuring the hardware acceleration component to implement the second segment. |
申请公布号 |
US2016379108(A1) |
申请公布日期 |
2016.12.29 |
申请号 |
US201514754384 |
申请日期 |
2015.06.29 |
申请人 |
Microsoft Technology Licensing, LLC |
发明人 |
Chung Eric;Strauss Karin;Ovtcharov Kalin;Kim Joo-Young;Ruwase Olatunji |
分类号 |
G06N3/04 |
主分类号 |
G06N3/04 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for implementing a deep neural network on a server component that comprises a host component including a CPU and a hardware acceleration component coupled to the host component, the deep neural network comprising a plurality of layers, the method comprising:
partitioning the deep neural network into a first segment and a second segment, the first segment comprising a first subset of the plurality of layers, the second segment comprising a second subset of the plurality of layers; configuring the host component to implement the first segment; and configuring the hardware acceleration component to implement the second segment. |
地址 |
Redmond WA US |