摘要 |
Data is prepared for processing in a data processing system using format information. Received data includes records that have values for fields. A target record format for processing the data is determined. Multiple records are analyzed (806) according to validation tests to determine (810) whether the data matches candidate record formats. Each candidate record format specifies a format for each field, and each validation test corresponds to at least one candidate record format. In response to receiving results of the validation tests, the target record format is associated with the data based on at least one of: a candidate record format (812) for which at least a partial match was determined according to at least one validation test, a parsed record format (830, 832, 834, 836, 838) selected according to a data type associated with the data, and a constructed record format (846) generated from an analysis of data characteristics. |