摘要 |
Systems and methods for providing efficient entity record conflation are presented. A first processing phase is made in regard to conflating location data of a corpus of entity records. This first processing phase is conducted in an offline, asynchronous manner to aggregate the entity records of a corpus of entity records into location clusters, each location cluster of entity records considered to correspond to a same structure at a particular geographic location. A second processing phase is conducted in a near real-time manner in regard to conflating received entity records with the entity records of the corpus of entity records. This second processing phase first matches received entity records to a location cluster, and then matches a received entity record to an entity record within the location cluster. Upon matching the received entity record with an entity record in a location cluster, the two entity records are conflated. |