发明名称 Data enrichment using heterogeneous sources
摘要 A data enrichment system may include an attribute relevance module to measure relevance of an attribute to a data object to be enriched. The data object may include the attribute including a known or an unknown value. An output value confidence module may calculate a confidence of an output value of a source used for enrichment of the data object. The output value may represent the known and/or unknown values of the attribute. The system may use the measured relevance of the attribute and the calculated confidence of the output value to determine assignment of the known or unknown values to the attribute.
申请公布号 US9542434(B2) 申请公布日期 2017.01.10
申请号 US201414553180 申请日期 2014.11.25
申请人 ACCENTURE GLOBAL SERVICES LIMITED 发明人 Gomadam Karthik;Yeh Peter Z.;Verma Kunal;Srivatsa Harsha Kumar
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Mannava & Kang, P.C. 代理人 Mannava & Kang, P.C.
主权项 1. A data enrichment system comprising: an attribute relevance module, executed by at least one hardware processor, to measure relevance of an attribute to a data object to be enriched, the data object including the attribute including one of a known and an unknown value, wherein the relevance of the attribute includes a determination of a unique association of the attribute with the data object, and a determination of a discriminative property of the attribute with respect to instances of the data object based on evaluation of an entropy of a predetermined number of past values of the attribute; and an output value confidence module, executed by the at least one hardware processor, to determine a confidence of a source based on utility of an output value of the source used for enrichment of the data object, the output value representing at least one of the known and unknown values of the attribute, wherein the system uses the measured relevance of the attribute and the determined confidence of the source to determine assignment of the unknown value to the attribute.
地址 Dublin IE