发明名称 Approximate matching of strings for message filtering
摘要 A method of determining whether a guarded term is represented in a message comprises associating a portion of the message with the guarded term and evaluating a cost of the association. A method of generating a collection of guarded terms that represents an original term comprises generating a plurality of variations of the original term, evaluating similarity of each of the plurality of variations with respect to the original term and determining whether the similarity meets a predetermined criterion.
申请公布号 US9471712(B2) 申请公布日期 2016.10.18
申请号 US200711927458 申请日期 2007.10.29
申请人 DELL SOFTWARE INC. 发明人 Oliver Jonathan J.;Oliver Andrew F.
分类号 G06F17/30;H04L12/58 主分类号 G06F17/30
代理机构 Polsinelli LLP 代理人 Polsinelli LLP
主权项 1. A method of identifying strings in an e-mail message, the method comprising: receiving an e-mail message; and executing instructions stored in memory, wherein execution of the instructions by a processor: identifies a text string in the e-mail message;determines that the identified text string in the e-mail message is not a safe string, wherein safe strings are predetermined strings stored in a database of acceptable terms and identified as legitimately present in e-mail messages;associates the text string with a guarded term from a database of guarded terms stored in memory, the guarded term being a string of special interest to a user;evaluates a cost of the association of the identified text string that dictates a probability that the identified text string is a mutation of the associated guarded term, wherein the evaluation compares similarities and differences between the identified text string and the guarded term, and wherein the evaluation assigns different penalties for the cost based on whether the mutation includes regular characters or special characters;matches the identified text string with the guarded term when the cost of association of the identified text string meets a predetermined threshold; andcharacterizes the e-mail message based on the matching between the identified text string and the guarded term.
地址 Round Rock TX US