发明名称 Iterative generation of partial column schema
摘要 Systems and methods for iteratively generating a partial column schema indicative of semantic relationships in a corpus of key-value data are disclosed. A set of textual values is extracted from a pre-existing corpus of key-value data and potential column names are generated. Value reassignment and potential column pruning proceeds based on semantic fit quality, potential column utilization and random factors influenced by a decreasing system temperature.
申请公布号 US9104707(B1) 申请公布日期 2015.08.11
申请号 US201313829375 申请日期 2013.03.14
申请人 AMAZON TECHNOLOGIES, INC. 发明人 Allen Nicholas Alexander
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Baler & Hostetler, LLP 代理人 Baler & Hostetler, LLP
主权项 1. A system comprising: one or more storage devices comprising one or more database files configured to maintain a set of items comprising a key and one or more values associated with the key; and one or more memories having stored thereon computer-readable instructions that upon execution cause the system at least to: extract a set of values from the set of items;generate a set of column names from the set of values;assign a plurality of values from the set of items to a plurality of column names in the set of column names;recursively reassign a first value to other column names in the plurality of column names based at least in part on a semantic fit quality and a utilization quality;wherein the semantic fit quality is based at least in part on a solution constraint and semantic similarity of the first value to a column name to which the first value is assigned and the other values in the plurality of values assigned to the column name, the solution constraint based at least in part on the first value sharing a common key with a value from the set of items; andwherein the utilization quality is based at least in part on a number of values currently assigned to the column name to which the first value is assigned and a comparison of the semantic fit quality to a prospective semantic fit quality.
地址 Seattle WA US