发明名称 Vector processing in an active memory device
摘要 Embodiments relate to vector processing in an active memory device. An aspect includes a system for vector processing in an active memory device. The system includes memory in the active memory device and a processing element in the active memory device. The processing element is configured to perform a method including decoding an instruction with a plurality of sub-instructions to execute in parallel. An iteration count to repeat execution of the sub-instructions in parallel is determined. Execution of the sub-instructions is repeated in parallel for multiple iterations, by the processing element, based on the iteration count. Multiple locations in the memory are accessed in parallel based on the execution of the sub-instructions.
申请公布号 US9535694(B2) 申请公布日期 2017.01.03
申请号 US201213569359 申请日期 2012.08.08
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 Fleischer Bruce M.;Fox Thomas W.;Jacobson Hans M.;Nair Ravi;Prener Daniel A.
分类号 G06F15/00;G06F1/04;G06F9/30;G06F9/38;G06F15/80 主分类号 G06F15/00
代理机构 Cantor Colburn LLP 代理人 Cantor Colburn LLP
主权项 1. A system for vector processing in an active memory device, the system comprising: a memory in the active memory device; and a processing element in the active memory device, the processing element configured to perform a method comprising: decoding, in the processing element, an instruction comprising a plurality of sub-instructions to execute in parallel;determining an iteration count to repeat execution of the sub-instructions in parallel based on decoding an iteration count source field of the instruction that defines whether to set the iteration count based on an iteration count field of the instruction or based on an iteration count register;repeating execution of the sub-instructions in parallel for multiple iterations, by the processing element, based on the iteration count;accessing multiple locations in the memory in parallel based on the execution of the sub-instructions;identifying a lane control sub-instruction in the instruction based on the decoding of the instruction, the lane control sub-instruction controlling a sequence of instruction execution and positioned in parallel with the sub-instructions to execute in parallel; andexecuting the lane control sub-instruction, by the processing element, only once after execution of the sub-instructions is performed in parallel for multiple iterations.
地址 Armonk NY US