发明名称 System and method of record matching in a database
摘要 A system and method of record matching using regular expressions and finite state representations. In this manner, the time (or computational effort) involved in record matching is reduced.
申请公布号 US9218372(B2) 申请公布日期 2015.12.22
申请号 US201213565484 申请日期 2012.08.02
申请人 SAP SE 发明人 Shami Mohammad;Wright Kevin
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 Fountainhead Law Group PC 代理人 Fountainhead Law Group PC
主权项 1. A computer-implemented method of record matching in a database, the computer-implemented method being implemented by a computer program that is stored by a memory of a computer system and executed by a processor of the computer system, the computer-implemented method comprising: generating, by the computer system, a plurality of regular expressions from a plurality of records, wherein each of the plurality of regular expressions corresponds to a corresponding one of the plurality of records; generating, by the computer system, a combined regular expression by combining the plurality of regular expressions, wherein generating the combined regular expression comprises generating the combined regular expression by performing a union operation on the plurality of regular expressions; generating, by the computer system, a combined finite state representation from the combined regular expression; processing, by the computer system, the combined finite state representation to identify that a first record matches a second record in the plurality of records; generating a subset of the plurality of records that does not contain matches by processing the plurality of records using the combined finite state representation; and checking whether a new record is a duplicate before adding it to the plurality of records by processing the new record using the combined finite state representation.
地址 Walldorf DE