发明名称 Generating statistics on text pattern matching predicates for access planning
摘要 Statistics for a pattern matching predicate are generated using stored character statistics. A first structure stores, for each of a plurality of character positions, frequently occurring characters in that character position, and a count of the number of occurrences of that character. A second structure stores frequently occurring characters that are subsequent to the frequently occurring characters stored in the first structure, and a probability of occurrence of each frequently occurring subsequent character. To form an estimate of the number of tuples matching a pattern matching predicate, statistics are retrieved for the matching characters in each matching position in the predicate, and then combined to produce the estimate. In the event a statistic is not stored for a desired character, the available statistics are used to make an estimate by accumulating statistics for other characters, and then calculating average frequency of occurrence of characters that do not have stored statistics.
申请公布号 US7386564(B2) 申请公布日期 2008.06.10
申请号 US20040758486 申请日期 2004.01.15
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 ABDO ABDO ESMAIL;DRUCKER TRAVIS MICHAEL
分类号 G06F7/00;G06F17/00;G06F17/30 主分类号 G06F7/00
代理机构 代理人
主权项
地址
您可能感兴趣的专利