发明名称 METHOD FOR STORING AND APPLYING RELATED SETS OF PATTERN/MESSAGE RULES
摘要 This invention provides a method and apparatus for efficiently storing and applying related sets of pattern/message rules that are used to analyse and annotate blocks of text. Where sets of rules can include other sets, representations of the sets that speed analysis can contain significant redundancy and add to the consumption of memory. In a one aspect of the invention, all rules are represented in a single pattern-matching data structure (which is applied to a block of text to find all matches by all rules) and the rulesets are represented using boolean vectors (one of which is used to filter the matches) which are compressed by identifying common subspans. In a further aspect of the invention, each ruleset is represented by its own pattern-matching data structure, and these are compressed by identifying common parts. In each aspect, the effect is to allow the creation of a data structure that can speed up matching without consuming excessive memory.
申请公布号 US2015082142(A1) 申请公布日期 2015.03.19
申请号 US201314396730 申请日期 2013.04.29
申请人 Citadel Corporation PTY LTD 发明人 Williams Ross Neil
分类号 G06F17/24 主分类号 G06F17/24
代理机构 代理人
主权项 1. A method for generating annotations for a block of text T using a ruleset S, the method comprising the steps of: (a) storing a plurality of rulesets containing a plurality of rules created by a plurality of entities, each-rule comprising a text pattern and a message; (b) representing a plurality of rulesets in a data structure D that allows any ruleset R to be applied to a block of text to generate annotations such that the operation has a time complexity less than O(RT); and (c) using D to apply a particular ruleset S to T to generate annotations.
地址 Adelaide AU
您可能感兴趣的专利