主权项 |
1. A method, comprising:
selecting a subset of data from a data set stored in a data repository; identifying a format in which values in the subset of data are represented by comparing a structure of at least one value in the subset of data to a plurality of known data patterns; inferring a candidate semantic type of data in the data set based on the identified format; validating values of the data in the data set against a set of domain rules, the set of domain rules establishing valid data values for the candidate semantic type; and presenting invalid values to a user, the invalid values representing values of the data in the data set that are disallowed by the set of domain rules; at least one of the selecting, identifying, inferring, validating and presenting steps being performed by at least one processor. |