摘要 |
Provided is a method for mining a data regular expression. The method comprises: obtaining data to be stored, and storing the data by using a dictionary tree structure; performing a node upgrade according to a regular expression rule; separately performing branch combination according to the number of subnodes having a same character; identifying an interfering branch, and performing branch deletion; and converting a rule tree to be in a character string format and outputting it. Obtained data is stored in a dictionary tree structure, so that mass data can be mined, data nodes are upgraded, branches are combined, an interfering branch is deleted, and finally, a generated rule tree is converted to be in a character string format for outputting, so as to mine a regular expression of mass data comprising erroneous data, and the rule tree can meet the requirement for mining the erroneous data and can be used to check data and find erroneous data thereof. In addition, further provided is a device for mining a data regular expression. |