摘要 |
<p>An automatic data mining tool that characterizes the relationships between different database fields from both structured and unstructured data (figure 1). It extracts a data model, identifies and categorizes all data fields, performs pre-processing to deal with unstructured data effectively, and processes the data without human intervention to automatically explore how the fields are related to one another. Prior to the commencement of user-controlled data mining, the present invention goes through all the fields in a database table space in order to establish meaningful relationships between various fields using whatever computer resources are available (i.e. by using 'cycle stealing'). This allows the present invention to run in the background and establish relationships between fields even before data mining (DM) begins, and determine redundant, useless, and/or trivial fields without any external guidance. This results in faster, more accurate data mining since these relationships are available before a user begins the process of data mining.</p> |