发明名称 Product record normalization system with efficient and scalable methods for discovering, validating, and using schema mappings
摘要 Systems and methods are disclosed herein for generating a normalized record from an import record, the normalized record having attribute-value pairs corresponding to a native schema. In import records, a plurality of attribute-value are identified each having an attribute label not found in a native schema. One or more attribute labels in the native schema having as possible values one or more values corresponding to the values of the plurality of attribute-value pairs are also identified. The computer system generates one or more normalization rules relating one or more attribute labels of the plurality of attribute-value pairs to at least a portion of the one or more attribute labels in the native schema. Normalization rules may be validated by crowdsourcing. Normalization rules may be applied by identifying implicated rules by classifying the import record and identifying rules applicable to the classification.
申请公布号 US9311372(B2) 申请公布日期 2016.04.12
申请号 US201313907243 申请日期 2013.05.31
申请人 Wal-Mart Stores, Inc. 发明人 Garera Nikesh Lucky;Rampalli Narasimhan;Ravikant Dintyala Venkata Subrahmanya;Subramaniam Srikanth;Sun Chong;Yalin Heather Dawn
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 Bryan Cave LLP 代理人 Bryan Cave LLP
主权项 1. A method for classification, the method comprising: identifying, by a computer system, in one or more import records, a plurality of attribute-value pairs each having: an attribute label not found in a native schema; and a value; identifying, by the computer system, one or more attribute labels in the native schema having as possible values one or more values corresponding to values of the plurality of attribute-value pairs; generating, by the computer system, one or more normalization rules relating the one or more attribute labels of the plurality of attribute-value pairs to at least a portion of the one or more attribute labels in the native schema; normalizing a plurality of non-normalized records according to the one or more normalization rules to generate a plurality of provisionally normalized records; transmitting the plurality of provisionally normalized records to a crowdsourcing forum; receiving, from the crowdsourcing forum, one or more favorable validation decisions with respect to a first portion of the plurality of provisionally normalized records; receiving, from the crowdsourcing forum, one or more unfavorable validation decisions with respect to a second portion of the plurality of provisionally normalized records; identifying one or more first normalization rules from the one or more normalization rules, the one or more first normalization rules corresponding to the first portion of the plurality of provisionally normalized records; and adding the one or more first normalization rules to a validated rule set.
地址 Bentonville AR US