发明名称 DEEP NEURAL NETWORK PARTITIONING ON SERVERS
摘要 A method is provided for implementing a deep neural network on a server component that includes a host component including a CPU and a hardware acceleration component coupled to the host component. The deep neural network includes a plurality of layers. The method includes partitioning the deep neural network into a first segment and a second segment, the first segment including a first subset of the plurality of layers, the second segment including a second subset of the plurality of layers, configuring the host component to implement the first segment, and configuring the hardware acceleration component to implement the second segment.
申请公布号 US2016379108(A1) 申请公布日期 2016.12.29
申请号 US201514754384 申请日期 2015.06.29
申请人 Microsoft Technology Licensing, LLC 发明人 Chung Eric;Strauss Karin;Ovtcharov Kalin;Kim Joo-Young;Ruwase Olatunji
分类号 G06N3/04 主分类号 G06N3/04
代理机构 代理人
主权项 1. A method for implementing a deep neural network on a server component that comprises a host component including a CPU and a hardware acceleration component coupled to the host component, the deep neural network comprising a plurality of layers, the method comprising: partitioning the deep neural network into a first segment and a second segment, the first segment comprising a first subset of the plurality of layers, the second segment comprising a second subset of the plurality of layers; configuring the host component to implement the first segment; and configuring the hardware acceleration component to implement the second segment.
地址 Redmond WA US