发明名称 Preventing unrecoverable errors during a disk regeneration in a disk array
摘要 Exemplary embodiments of the present invention disclose a method and system for reducing a probability of generating an unrecoverable error on a disk array during a disk rebuild. In a step, an exemplary embodiment identifies a disk to be replaced in the disk array, the disk array including a spare disk. In another step, an exemplary embodiment locates a region in the disk array that incurs a high number of reads and writes during a period prior to replacing the disk in the disk array. In another step, an exemplary embodiment scrubs data in a region in the disk array that has incurred a high number of accesses. In another step, an exemplary embodiment replaces the disk identified to be replaced with the spare disk in the disk array. In another step, an exemplary embodiment rebuilds data on the replaced disk on the spare disk in the disk array.
申请公布号 US9104604(B2) 申请公布日期 2015.08.11
申请号 US201313776904 申请日期 2013.02.26
申请人 International Business Machines Corporation 发明人 Cooper Alastair G.;Groseclose, Jr. Michael R.;Kahler David R.;Lovrien Kurt A.
分类号 G06F11/10 主分类号 G06F11/10
代理机构 代理人 Gooshaw Isaac J.
主权项 1. A method for reducing a probability of generating an unrecoverable error on a disk array during a disk rebuild, the method comprising the steps of: generating, by one or more processors, a record of reads and writes being made to a plurality of disks included in a disk array, wherein the record indicates a hot spot included in a first disk of the plurality of disks; identifying, by one or more processors, a second disk in the disk array that has incurred a threshold number of correctable errors that dictate that the second disk be replaced with a spare disk included in the disk array; determining, by one or more processors, whether the hot spot included in the first disk includes data that will be used to rebuild the second disk using the spare disk; responsive to an occurrence of both i) an identification of a second disk in the disk array that has met the threshold number of correctable errors that dictate that the second disk be replaced with a spare disk and ii) a determination that the hot spot included in the first disk includes data that will be used to rebuild the second disk using the spare disk, initiating, by one or more processors, scrubbing of data in a region of the first disk that includes the hot spot, wherein scrubbing of data in the region of the first disk reduces a probability of generating an unrecoverable error on the disk array during a rebuild of the second disk using data of the first disk; replacing, by one or more processors, the second disk with the spare disk; and rebuilding, by one or more processors, data included in the second disk on the spare disk using data included on the first disk that was scrubbed.
地址 Armonk NY US