发明名称 GENERATING A REGULAR EXPRESSION FOR ENTITY EXTRACTION
摘要 A computer receives a formatted query having a plain text word. The computer selects each character in the plain text word. The computer identifies a group of characters from a confusion matrix that are commonly confused with the character selected. The computer generates a set of characters for each character selected, wherein the set of characters begin with one of the each character selected followed by and ending with the group of characters from the confusion matrix. The computer generates a regular expression by concatenating each of the set of characters.
申请公布号 US2014309984(A1) 申请公布日期 2014.10.16
申请号 US201313860547 申请日期 2013.04.11
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 Bostick James E.;Dalal Keyur D.;Ganci, JR. John M.;Trim Craig M.
分类号 G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项 1. A method for generating a regular expression utilized for entity extraction, the method comprising the steps of: receiving a formatted query having a plain text word; selecting each character in the plain text word; identifying a group of characters from a confusion matrix that are commonly confused with the character selected; generating a set of characters for each character selected, wherein the set of characters begin with one of the each character selected followed by and ending with the group of characters from the confusion matrix; and generating a regular expression by concatenating each of the set of characters.
地址 Armonk NY US