发明名称 Method and apparatus for modulo scheduled loop execution in a processor architecture
摘要 A processor method and apparatus that allows for the overlapped execution of multiple iterations of a loop while allowing the compiler to include only a single copy of the loop body in the code while automatically managing which iterations are active. Since the prologue and epilogue are implicitly created and maintained within the hardware in the invention, a significant reduction in code size can be achieved compared to software-only modulo scheduling. Furthermore, loops with iteration counts less than the number of concurrent iterations present in the kernel are also automatically handled. This hardware enhanced scheme achieves the same performance as the fully-specified standard method. Furthermore, the hardware reduces the power requirement as the entire fetch unit can be deactivated for a portion of the loop's execution. The basic design of the invention involves including a plurality of buffers for storing loop instructions, each of which is associated with an instruction decoder and its respective functional unit, in the dispatch stage of a processor. Control logic is used to receive loop setup parameters and to control the selective issue of instructions from the buffers to the functional units.
申请公布号 US7302557(B1) 申请公布日期 2007.11.27
申请号 US20000728441 申请日期 2000.12.01
申请人 IMPACT TECHNOLOGIES, INC. 发明人 HWU WEN-MEI W.;MERTEN MATTHEW C.
分类号 G06F9/30;G06F9/45 主分类号 G06F9/30
代理机构 代理人
主权项
地址