发明名称 Methods and systems for implementing approximate string matching within a database
摘要 A computer-based method for character string matching of a candidate character string with a plurality of character string records stored in a database is described. The method includes a) identifying a set of reference character strings in the database, the reference character strings identified utilizing an optimization search for a set of dissimilar character strings, b) generating an n-gram representation for one of the reference character strings in the set of reference character strings, c) generating an n-gram representation for the candidate character string, d) determining a similarity between the n-gram representations, e) repeating steps b) and d) for the remaining reference character strings in the set of identified reference character strings, and f) indexing the candidate character string within the database based on the determined similarities between the n-gram representation of the candidate character string and the reference character strings in the identified set.
申请公布号 US8219550(B2) 申请公布日期 2012.07.10
申请号 US201113041075 申请日期 2011.03.04
申请人 MERZ CHRISTOPHER J.;MCGEEHAN THOMAS;MASTERCARD INTERNATIONAL INCORPORATED 发明人 MERZ CHRISTOPHER J.;MCGEEHAN THOMAS
分类号 G06F7/00 主分类号 G06F7/00
代理机构 代理人
主权项
地址