METHOD AND SYSTEM FOR FUSING BUSINESS DATA FOR DISTRIBUTIONAL QUERIES
摘要
The present disclosure relates to business data processing and facilitates fusing business data spanning disparate sources for processing distributional queries for enterprise business intelligence application. Particularly, the method comprises defining a Bayesian network based on one or more attributes associated with raw data spanning a plurality of disparate sources; pre-processing the raw data based on the Bayesian network to compute conditional probabilities therein as parameters; joining the one or more attributes in the raw data using the conditional probabilities; and executing probabilistic inference from a database of the parameters by employing an SQL engine. The Bayesian Network may be validated based on estimation error computed by comparing results of processing a set of validation queries on the raw data and the Bayesian Network.