发明名称 Identifying non-distinct names in a set of names
摘要 Non-distinct names are identified in a set of names. The set of names is obtained for a first entity. In response to comparing a first name and a second name in the set of names, it is determined that the first name is similar to the second name. Initials in the first name and the second name are searched for. In response to the search indicating that there is at least one initial in at least one of the first name and the second name, it is determined that the at least one initial matches a corresponding initial in another one of the first name and the second name and one of the first name and the second name are marked as a non-distinct name. A cross-entity scoring technique using distinct names in the set of names for the first entity and names in another set of names for a second entity is applied.
申请公布号 US8364692(B1) 申请公布日期 2013.01.29
申请号 US201113208189 申请日期 2011.08.11
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION;ALLEN THOMAS B.;MACY BRIAN E.;VINCENT CAROLJAYNE J. 发明人 ALLEN THOMAS B.;MACY BRIAN E.;VINCENT CAROLJAYNE J.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址