发明名称 |
MANAGING AN ARCHIVE FOR APPROXIMATE STRING MATCHING |
摘要 |
In one aspect, in general, a method is described for managing an archive for determining approximate matches associated with strings occurring in records. The method includes: processing records to determine a set of string representations that correspond to strings occurring in the records; generating, for each of at least some of the string representations in the set, a plurality of close representations that are each generated from at least some of the same characters in the string; and storing entries in the archive that each represent a potential approximate match between at least two strings based on their respective close representations. |
申请公布号 |
US2015066862(A1) |
申请公布日期 |
2015.03.05 |
申请号 |
US201414325007 |
申请日期 |
2014.07.07 |
申请人 |
Ab Initio Technology LLC |
发明人 |
Anderson Arlen |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for managing an archive for determining approximate matches associated with strings occurring in records, the method including:
processing records to determine a set of string representations that correspond to strings occurring in the records; generating, for each of at least some of the string representations in the set, a plurality of close representations that are each generated from at least some of the same characters in the string; and storing entries in an archive that each represent a potential approximate match between at least two strings based on their respective close representations. |
地址 |
Lexington MA US |