发明名称 |
METHOD AND SYSTEM FOR IMPLEMENTING APPROXIMATE STRING MATCHING WITHIN DATABASE |
摘要 |
PROBLEM TO BE SOLVED: To provide a computer-based method for character string matching of a candidate character string with a plurality of character string records stored in a database.SOLUTION: The method includes: a step a) of identifying a set of reference character strings in a database, wherein the reference character strings are identified utilizing an optimization search for a set of dissimilar character strings; a step b) of generating an n-gram representation for one of the reference character strings in the set of reference character strings; a step c) of generating an n-gram representation for the candidate character string; a step d) of determining a similarity between the n-gram representations; a step e) of repeating the steps b) and d) for the remaining reference character strings in the set of identified reference character strings; and a step f) of indexing the candidate character strings within the database based on the determined similarities between the n-gram representations of the candidate character string and the reference character strings in the identified set. |
申请公布号 |
JP2014029713(A) |
申请公布日期 |
2014.02.13 |
申请号 |
JP20130193143 |
申请日期 |
2013.09.18 |
申请人 |
MASTERCARD INTERNATIONAL INC |
发明人 |
CHRISTOPHER J MERZ;THOMAS MCGEEHAN |
分类号 |
G06F17/30;G06Q40/02 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|