发明名称 Implementing memory performance management and enhanced memory reliability accounting for thermal conditions
摘要 A method, system and computer program product implement memory performance management and enhanced memory reliability of a computer system accounting for system thermal conditions. When a primary memory temperature reaches an initial temperature threshold, reads are suspended to the primary memory and reads are provided to a mirrored memory in a mirrored memory pair, and writes are provided to both the primary memory and the mirrored memory. If the primary memory temperature reaches a second temperature threshold, write operations to the primary memory are also stopped and the primary memory is turned off with DRAM power saving modes such as self timed refresh (STR), and the reads and writes are limited to the mirrored memory in the mirrored memory pair. When the primary memory temperature decreases to below the initial temperature threshold, coherency is recovered by writing a coherent copy from the mirrored memory to the primary memory.
申请公布号 US9442816(B2) 申请公布日期 2016.09.13
申请号 US201113307149 申请日期 2011.11.30
申请人 International Business Machines Corporation 发明人 Cordero Edgar R.;Dell Timothy J.;Henderson Joab D.;Lingambudi Anil B.;Paulraj Girisankar;Vidyapoornachary Diyanesh B.
分类号 G06F13/00;G06F11/30;G06F11/16 主分类号 G06F13/00
代理机构 代理人 Pennington Joan
主权项 1. A method for implementing memory performance management and enhanced memory reliability of a computer system comprising: monitoring temperature of a memory, monitoring temperature of the memory includes identifying a thermal profile of the memory; identifying a first temperature threshold for a primary memory being exceeded, responsively suspending reads to the primary memory and performing reads to a mirrored memory in a mirrored memory pair; responsive to detecting an error from the mirrored memory during the first temperature for the primary memory being exceeded, temporarily performing reads to the primary memory for recovering correct data, and performing writes to both the primary memory and the mirrored memory; identifying a second temperature threshold for the primary memory, turning off the primary memory in response to said second temperature threshold greater than said first temperature threshold, and responsively limiting reads and writes to the mirrored memory in the mirrored memory pair; and identifying temperature decrease below said first temperature threshold for the primary memory, and responsively enabling the primary memory, and performing a coherent copy of unavailable data from mirrored memory to the enabled primary memory, recovering coherency of the primary memory and the mirrored memory, and recovering full operation of mirrored memory and primary memory, start mirroring and continue temperature monitoring in response to recovering coherency of the primary memory and the mirrored memory completed.
地址 Armonk NY US