发明名称 Real time single-sweep detection of key words and content analysis
摘要 A system and method are provided for real-time analysis of text. During a single sweep through the text, a detection tree is used to identify a sequence of characters in the text from a large dictionary of keywords. When a keyword is detected a rule tally database is updated. An intermediate score may be available during the sweep and a final score of the text may be available substantially immediately upon finishing the single sweep. A second text may be analyzed immediately using the same score buffer and rule tally database without updating the rule tally database.
申请公布号 US9519704(B2) 申请公布日期 2016.12.13
申请号 US201213885164 申请日期 2012.05.14
申请人 NetSpark LTD 发明人 Korenwaitz Elyasaf;Doron Moshe
分类号 G06F7/00;G06F17/00;G06F17/30;G06F17/27 主分类号 G06F7/00
代理机构 代理人 Lessoff Steven C.
主权项 1. A system for analyzing of a plurality of texts, the system comprising: A) a plurality of keywords stored in a non-transitory computer readable structure, each keyword of said plurality of keywords stored as a respective site of a plurality of sites, said structure including a plurality of branches, each branch having plurality of sub branches, a unique ordered combination of a branch and one or more sub branches leading to each said respective site; each branch of said plurality of branches and each sub branch of said plurality of sub branches being associated with one respective character string, each said keyword being the ordered combination of respective character strings associated with the unique ordered combination of branch and sub branches leading to said respective site and each branch of said plurality of branches being included in more than one said unique combination of a branch and one or more sub branches leading to more than one respective site; B) a plurality of databases including a respective database corresponding to each text of the plurality of texts; each said respective database including a plurality of counters including a respective incidence corresponding to each of said plurality of keywords; C) a computer configured for said analyzing text of said plurality of texts including a) reading a current character string from said one text starting at a start character and selecting from said plurality of branches a current branch associated with said character string,b) recursively navigating said structure according to said one text subsequent to said first character string; said recursively navigating including, i) further reading from said one text a sequential character string sequential to said current character string,ii) selecting from a plurality of sub branches of said current branch, a next branch associated with said sequential character string andiii) replacing said current character string with said sequential character string and replacing said current branch with said next branch and repeating said further reading and said selecting and said replacing until a current branch leads to an encountered site of said plurality of sites,c) incrementing in said respective database of said one text said respective incidence counter of a keyword when said keyword is found in said text starting at said start character by incrementing in said respective database of said one text said respective incidence counter of a keyword stored as said encountered site,e) designating a new start character in the one text and repeating said reading, said navigating, said incrementing andf) repeating said designating until for each of said plurality of keywords, said respective incidence counter indicates the number of times that said keyword appears in the one text; D) a plurality of score buffers including at least one respective score buffer for each said text; each said score buffer storing a computer modifiable value; and wherein said computer is further configured for updating said value stored in a respective score buffer for said one text dependent on a limitation on a value of said respective incidence counter of at least two of said plurality of keywords E) a rule counter and wherein said computer is further configured to set a value said rule counter dependent on at least two of said incidence counters, and wherein said updating is dependent on a value of said of rule counter.
地址 Petach Tikva IL