发明名称 Identification Disambiguation in Databases
摘要 Various systems and methods are provided for identification disambiguation in databases. In one embodiment, a system includes an approximate structural equivalence (ASE) analyzer including logic that obtains a set of records from a database; logic that determines a knowledge homogeneity score (KHS) for a pair of records in the set of records; and logic that determines a condition of ASE for the pair of records based upon the KHS and a predefined KHS threshold. In another embodiment, a method includes determining a plurality of references shared by at least two records in a set of records; determining a weighting value for each shared reference; and determining a KHS for each pair of records in the set of records based upon at least one reference shared by the pair of records and the weighting value corresponding to the at least one shared reference.
申请公布号 US2011082862(A1) 申请公布日期 2011.04.07
申请号 US20100893253 申请日期 2010.09.29
申请人 GEORGIA TECH RESEARCH CORPORATION 发明人 WALSH JOHN P.;TANG LI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址
您可能感兴趣的专利