发明名称 MANAGING DATA SETS OF A STORAGE SYSTEM
摘要 A method, system, and computer program product for managing data sets of a storage facility is disclosed. The method, system, and computer program product include determining, by analyzing a first data set, that the first data set includes a first record having padded data. To identify the padded data, the method, system, and computer program product include comparing at least a portion of the first record of the first data set with a second record of a second data set. Next, the method, system, and computer program product include removing, from the first record of the first data set, the padded data.
申请公布号 US2016224597(A1) 申请公布日期 2016.08.04
申请号 US201615133674 申请日期 2016.04.20
申请人 International Business Machines Corporation 发明人 Chauvet Philip R.;McCune Franklin E.;Reed David C.;Smith Max D.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer-implemented method for managing data sets of a storage system, the method comprising: determining, by analyzing a first data set, that the first data set includes a first record, wherein the determining, by analyzing the first data set, that the first data set includes the first record includes: determining that the first record contains a padded data set,determining that the first record has been converted to a fixed length record from a variable length record,determining that the first data set is without a backup data set, andscanning at least the portion of the first record to resolve a character pattern, wherein the scanning includes scanning from a back end of the first record toward a front end of the first record until the character pattern stops; comparing, to identify the padded data set, at least a portion of the first record of the first data set with a second record of a second data set, wherein the comparing, to identify the padded data set, at least the portion of the first record of the first data set with the second record of the second data set includes: comparing the character pattern of the first record of the first data set with the second record of the second data set, wherein the second data set is the first data set and the first record is different from the second record, andstoring the character pattern and determining that a mask derived from the character pattern matches at least a segment of a subsequent record of the first data set,searching, using a key from the second data set, the first data set for the key,determining the key in the second record matches a like key in the first record,scanning from the back end of the first record toward the front end of the first record to resolve the character pattern configured to identify the padded data set as the segment that mismatches the second record, andstoring the mask derived from the character pattern to identify the padded data set; removing, from the first record of the first data set, the padded data set identified in response to comparing at least the portion of the first record of the first data set with the second record of the second data set, wherein the removing includes: deleting a segment of the first record matching a mask derived from a character pattern, andupdating a record length for the first record; storing an original file, with padded data, as a retained file with the first data set; and storing the temporary file, without padded data, with a name of the original file.
地址 Armonk NY US