发明名称 Normalization Rule Generation and Implementation Systems and Methods
摘要 Product records having attributes according to various schema are normalized such that the attributes thereof conform to a canonical schema. Attributes for large numbers of product records are normalized according to a map-reduce framework in which mappers are defined but no reducers. Rules are implemented by a rule engine that is instantiated one time per VM of the map-reduce framework. Likewise, each rule may be implemented by only one object instance per rule engine instance. Generation of rules by analysts may be facilitated by defining an attribute hierarchy. A rule defined for a parent attribute may be presented as a default rule for a child attribute and either accepted or revised. Attributes may be clusters and proposed rules generated. Proposed rules may then be accepted or rejected by an analyst.
申请公布号 US2016034500(A1) 申请公布日期 2016.02.04
申请号 US201414447454 申请日期 2014.07.30
申请人 Wal-Mart Stores, Inc. 发明人 Yang Fan;Rampalli Narasimhan;Xie Jun
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method for attribute normalization, the method comprising: defining a rule engine programmed to create a rule engine instance implementing a single instance of each of a plurality of rule application objects; configuring a cluster of computer systems to implement a map-reduce framework including N mapper instances and zero reducer instances; instantiating a plurality of virtual machines on the cluster of computer systems and instantiating one rule engine instance on each virtual machine of the plurality of virtual machines; receiving a plurality of product records each having a plurality of attributes, the attributes of the product records being represented according to a plurality of different schema; allocating to each rule engine instance an allocation of the plurality of attributes of the plurality of product records to the rule engine instances of the plurality of virtual machines; processing in each rule engine instance the allocation allocated thereto to transform the attributes of the allocation to normalized attributes of a canonical schema; and storing updated records for the plurality of product records including the normalized attributes.
地址 Bentonville AR US