发明名称 Efficient data pattern matching
摘要 Exemplary methods, apparatuses, and systems for parsing unstructured data with a plurality of pattern matching rules are disclosed. An optimized pattern matching rule for one or more respective pattern matching rules is derived from an original pattern matching rule. The optimized pattern matching rule includes an extracted text string from the respective pattern matching rule or a less complex pattern match than the respective pattern matching rule. If the extracted text string or pattern is determined to match any of the data to be parsed, application of the original pattern matching rule is bypassed. The original pattern matching rule is applied when the one or more optimized pattern matching rules match the data.
申请公布号 US9244903(B2) 申请公布日期 2016.01.26
申请号 US201313863233 申请日期 2013.04.15
申请人 VMware, Inc. 发明人 Huang Chengdu;Li Zhenmin;Xanthos Spiros
分类号 G06F17/30;G06F17/27;G06N5/02 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer-implemented method of parsing data using a plurality of pattern matching rules, the method comprising: deriving one or more optimized pattern matching rules from an original pattern matching rule, wherein the original pattern matching rule includes a constant text string and one or more regular expression operators that define variable patterns of characters that occur prior to or following the text string in order to satisfy the original pattern matching rule, and wherein deriving one or more optimized pattern matching rules includes extracting the constant text string from the original pattern matching rule to generate an optimized pattern matching rule including the constant text string and excluding the one or more regular expression operators; determining if the one or more optimized pattern matching rules match any of the data to be parsed; bypassing an application of the original pattern matching rule to the data when the optimized pattern matching rule including the constant string does not match the data; and applying the original pattern matching rule to the data when the one or more optimized pattern matching rules match the data, wherein applying the original pattern matching rule includes determining if the original pattern matching rule matches any of the data to be parsed.
地址 Palo Alto CA US