发明名称 Local and global memory request predictor
摘要 A method, circuit arrangement, and design structure utilize broadcast prediction data to determine whether to globally broadcast a memory request in a computing system of the type that includes a plurality of nodes, each node including a plurality of processing units. The method includes updating broadcast prediction data for a cache line associated with a first memory request within a hardware-based broadcast prediction data structure in turn associated with a first processing unit in response to the first memory request, the broadcast prediction data for the cache line including data associated with a history of ownership of the cache line. The method further comprises accessing the broadcast prediction data structure and determining whether to perform an early broadcast of a second memory request to a second node based on broadcast prediction data within the broadcast prediction data structure in response to that second memory request associated with the cache line.
申请公布号 US8874853(B2) 申请公布日期 2014.10.28
申请号 US201012793795 申请日期 2010.06.04
申请人 International Business Machines Corporation 发明人 Cantin Jason F.;Kunkel Steven R.
分类号 G06F12/00;G06F12/08 主分类号 G06F12/00
代理机构 Wood, Herron & Evans, LLP 代理人 Wood, Herron & Evans, LLP
主权项 1. A method of determining whether to globally broadcast a memory request in a computing system of the type that includes a plurality of nodes, wherein each node includes a plurality of processing units, the method comprising, in a first node among the plurality of nodes: in response to a first memory request, updating broadcast prediction data for a cache line associated with the first memory request within a hardware-based broadcast prediction data structure in turn associated with a first processing unit among the plurality of processing units, the broadcast prediction data for the cache line including data associated with a history of ownership of the cache line, wherein updating the broadcast prediction data for the cache line associated with the first memory request includes updating the broadcast prediction data for the cache line to indicate that the cache line is in one of at least three states; and in response to a second memory request that is associated with the cache line, accessing the broadcast prediction data structure and determining whether to perform an early broadcast of the second memory request to a second node among the plurality of nodes based on broadcast prediction data within the broadcast prediction data structure.
地址 Armonk NY US