发明名称 MANAGING DATA SETS OF A STORAGE SYSTEM
摘要 A method, system, and computer program product for managing data sets of a storage facility is disclosed. The method, system, and computer program product include determining, by analyzing a first data set, that the first data set includes a first record having padded data. To identify the padded data, the method, system, and computer program product include comparing at least a portion of the first record of the first data set with a second record of a second data set. Next, the method, system, and computer program product include removing, from the first record of the first data set, the padded data.
申请公布号 US2017068699(A1) 申请公布日期 2017.03.09
申请号 US201615352247 申请日期 2016.11.15
申请人 International Business Machines Corporation 发明人 Chauvet Philip R.;McCune Franklin E.;Reed David C.;Smith Max D.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A system for managing data sets in a storage facility, comprising: a remote device; and a host device, at least one of the remote device and the host device including a managing module, the managing module comprising: a determining module to determine, by analyzing a first data set, that the first data set includes a first record, wherein the determining, by analyzing the first data set, that the first data set includes the first record includes: determine that the first record contains a padded data set,determine that the first record has been converted to a fixed length record from a variable length record,determine that the first data set is without a backup data set, andscan at least the portion of the first record to resolve a character pattern, wherein the scanning includes scanning from a back end of the first record toward a front end of the first record until the character pattern stops;a comparing module to compare, to identify the padded data set, at least a portion of the first record of the first data set with a second record of a second data set, wherein the comparing, to identify the padded data set, at least the portion of the first record of the first data set with the second record of the second data set includes: compare the character pattern of the first record of the first data set with the second record of the second data set, wherein the second data set is the first data set and the first record is different from the second record, andstore the character pattern and determining that a mask derived from the character pattern matches at least a segment of a subsequent record of the first data set,search, using a key from the second data set, the first data set for the key,determine the key in the second record matches a like key in the first record,scan from the back end of the first record toward the front end of the first record to resolve the character pattern configured to identify the padded data set as the segment that mismatches the second record, andstore the mask derived from the character pattern to identify the padded data set;a removing module to remove, from the first record of the first data set, the padded data set identified in response to comparing at least the portion of the first record of the first data set with the second record of the second data set, wherein the removing includes: delete a segment of the first record matching a mask derived from a character pattern, andupdate a record length for the first record;store an original file, with padded data, as a retained file with the first data set; andstoring the temporary file, without padded data, with a name of the original file.
地址 Armonk NY US