发明名称 Key profile computation and data pattern profile computation
摘要 Architecture that provides a data profile computation technique which employs key profile computation and data pattern profile computation. Key profile computation in a data table includes both exact keys as well as approximate keys, and is based on key strengths. A key strength of 100% is an exact key, and any other percentage in an approximate key. The key strength is estimated based on the number of table rows that have duplicated attribute values. Only column sets that exceed a threshold value are returned. Pattern profiling identifies a small set of regular expression patterns which best describe the patterns within a given set of attribute values. Pattern profiling includes three phases: a first phases for determining token regular expressions, a second phase for determining candidate regular expressions, and a third phase for identifying the best regular expressions of the candidates that match the attribute values.
申请公布号 US7720883(B2) 申请公布日期 2010.05.18
申请号 US20070769050 申请日期 2007.06.27
申请人 MICROSOFT CORPORATION 发明人 CHEN ZHIMIN;GANTI VENKATESH;JHA GUNJAN;KAUSHIK SHRIRAGHAV;NARASAYYA VIVEK
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 代理人
主权项
地址