发明名称 Probabilistic record linkage model derived from training data
摘要 A method of training a system from examples achieves high accuracy by finding the optimal weighting of different clues indicating whether two data items such as database records should be matched or linked. The trained system provides three possible outputs when presented with two data items: yes, no or I don't know (human intervention required). A maximum entropy model can be used to determine whether the two records should be linked or matched. Using the trained maximum entropy model, a high probability indicates that the pair should be linked, a low probability indicates that the pair should not be linked, and intermediate probabilities are generally held for human review.
申请公布号 US6523019(B1) 申请公布日期 2003.02.18
申请号 US19990429514 申请日期 1999.10.28
申请人 CHOICEMAKER TECHNOLOGIES, INC. 发明人 BORTHWICK ANDREW E.
分类号 A61C3/00;A61C17/00;G06F15/18;G06F17/30;(IPC1-7):G06N5/00 主分类号 A61C3/00
代理机构 代理人
主权项
地址