摘要 |
<p>A system for distributed processing of data records on a distributed database computer system. A matrix set is used to calculate a value set corresponding to each data record by using said data record as input. The matrix set comprises at least one processing parameter matrix and preferably a metadata matrix, a predictor to parameter correlation matrix and a beta function matrix for use as a link function in a generalized linear model. The value set comprises at least one calculation value. The distributed database computer system comprises processing units united in a share-nothing parallel processing architecture. Each processing unit comprises a processor of the each processing unit, a processor memory of the each processing unit, and a storage memory the each processing unit. The data records and the matrix set are stored in a distributed database using the storage memories. Each processor is operable for executing user defined functions, calculating the value set corresponding to only one data record at a time, executing transaction processing, storing data in the processor memory, and using the data stored in the processor memory for execution of the user defined functions within a framework of one transaction, wherein a first UDF is used to load the matrix set into the processor memory for subsequent multiple executions of a second UDF using the matrix set to calculate the value set.</p> |