发明名称 Grammer for regular expressions
摘要 The grammar of this invention is an approach to regular expressions which introduces advantages to programmers who use regular expressions for scanning, searching, and tokenizing text: Allowing composition of regular expressions (patterns) through the standard C/C++ operators (using C/C++ precedence and associativity), thus appealing to a huge audience of programmers already familiar with that style of grammar. Generalizing the negated character-class (so familiar to Perl programmers) in a way that quite dramatically allows ANY pattern composition to be used for searching (the "subjunctive" binary composition). Integrating into an elegantly simple grammar form ("do-pattern") the ability to create arbitrary side-effects of tokenization, accomplished in prior art through a cumbersome combination of tokenizing expressions and parse trees, such as in the grammar style of Lex-Yacc. Generalizing the capture-to-variable feature (as seen in Perl), allowing the capture of portions of the stream (match sub-expressions) into any variable current in the scope of the regular-expression. Allowing the parameterization of production rules, as templates, which allow similar (in form) regular expressions to be written as multiple instantiations of the same production rule template (accomplished via in params). Further allowing parameterization of production rule templates to extend to the capture output of the expressions (accomplished via a combination of "do-patterns", "capture-patterns", and out or in/out params). Creating novel support algorithms (to accomplish the above) not seen in any texts on finite automata.
申请公布号 US2004225999(A1) 申请公布日期 2004.11.11
申请号 US20030691414 申请日期 2003.10.22
申请人 NUSS ANDREW 发明人 NUSS ANDREW
分类号 G06F9/44;G06F9/45;(IPC1-7):G06F9/44 主分类号 G06F9/44
代理机构 代理人
主权项
地址