发明名称 |
Systems, methods, and apparatuses to decompose a sequential program into multiple threads, execute said threads, and reconstruct the sequential execution |
摘要 |
Systems, methods, and apparatuses for decomposing a sequential program into multiple threads, executing these threads, and reconstructing the sequential execution of the threads are described. A plurality of data cache units (DCUs) store locally retired instructions of speculatively executed threads. A merging level cache (MLC) merges data from the lines of the DCUs. An inter-core memory coherency module (ICMC) globally retire instructions of the speculatively executed threads in the MLC. |
申请公布号 |
US8909902(B2) |
申请公布日期 |
2014.12.09 |
申请号 |
US200912624804 |
申请日期 |
2009.11.24 |
申请人 |
Intel Corporation |
发明人 |
Latorre Fernando;Codina Josep M.;Codina Enric Gibert;Lopez Pedro;Madriles Carlos;Vincente Alejandro Martinez;Martinez Raul;Gonzalez Antonio |
分类号 |
G06F9/40 |
主分类号 |
G06F9/40 |
代理机构 |
Blakely, Sokoloff, Taylor & Zafman LLP |
代理人 |
Blakely, Sokoloff, Taylor & Zafman LLP |
主权项 |
1. An apparatus comprising:
a plurality of data cache units (DCUs) to store locally retired instructions of speculatively executed threads, wherein the DCUs include a version bit per line to classify an speculative update status of each line; a merging level cache (MLC) coupled to the DCUs to merge data from the lines of the DCUs, wherein the MLC includes a speculative bit per cache line to indicate that a cache line contains a speculative state and last version bits to indicate which core of a plurality of cores that made a last change to a chunk, wherein a chunk is a granularity at which memory disambiguation between two speculative threads is detectable; and a inter-core memory coherency module (ICMC) to globally retire instructions of the speculatively executed threads in the MLC. |
地址 |
Santa Clara CA US |