摘要 |
A method of generating a test database from a deployed database by “sanitizing”, or removing sensitive data, is employed for recreating problem scenarios from a customer site, verifying compatibility of customer data with database version updates, and for performance testing using actual, rather than generated, database structures. In database management operations, in can be beneficial to use data generated from actual operational scenarios rather than artificially generated data created from a test pattern. Data generated from actual operation, such has at a customer site, assures compatibility with the relation patterns and record volumes employed by the customer (user). However, databases often contain sensitive information that would be inappropriate, illegal, or vulnerable in a testing environment. Accordingly, the generated test database overwrites sensitive data values with benign, or generic values while preserving the structure and relations of the data stored. |
主权项 |
1. A method of sanitizing a database comprising:
identifying tables of sensitive data; identifying columns of entries in the identified tables having sensitive entries; defining, for each column in the identified column, a scrubbing operation for overwriting sensitive data in the respective column; identifying relations between the identified tables, the relations identifying corresponding values between different tables; computing an ordering of the columns, the ordering indicative of dependencies between the tables defined by the relations; and performing, for each identified column, the defined scrubbing operation on the entries in the identified column, the scrubbing operations performed according to the computed ordering. |