发明名称 Automated correction of natural language processing systems
摘要 Machine logic that automatically detects natural language processing (NLP) system annotation errors and correspondingly updates NLP annotators to prevent future erroneous annotations by performing the following steps: (i) determining that a first annotation error has occurred in an annotation of a corpus by the natural language processing system; (ii) generating a candidate set of annotation correction actions, where each annotation correction action of the set is adapted to prevent an occurrence of an error similar to the first annotation error by the natural language processing system; (iii) selecting an annotation correction action from the candidate set of annotation correction actions, based, at least in part, on a set of annotation correction confidence characteristics; and (iv) automatically applying the selected annotation correction action to the natural language processing system.
申请公布号 US9535894(B2) 申请公布日期 2017.01.03
申请号 US201514696677 申请日期 2015.04.27
申请人 International Business Machines Corporation 发明人 Carrier Scott R.;Mustafi Joy;Omanwar Anil M.;Polisetty Venkata Sai Avinesh
分类号 G06F17/24;G06F17/28;G06F17/27;G06F17/25 主分类号 G06F17/24
代理机构 代理人 Hartwell William H.
主权项 1. A method comprising: causing, by one or more processors, a first natural language processing (NLP) annotator of an NLP system to annotate a corpus, thereby producing an annotated corpus that includes a first set of annotation(s); causing, by one or more processors, a second NLP annotator of the NLP system to annotate the annotated corpus that includes the first set of annotation(s), thereby producing a second set of annotation(s) that annotate the annotated corpus that includes the first set of annotation(s); determining, by one or more processors, based, at least in part, on the second set of annotation(s), that a first annotation error has occurred in the annotation of the corpus by the first NLP annotator; identifying, by one or more processors, a cause of the first annotation error; generating, by one or more processors, a candidate set of annotation correction actions, where each annotation correction action of the set is directed to the identified cause and is adapted to prevent an occurrence of an error similar to the first annotation error by the first NLP annotator; selecting, by one or more processors, an annotation correction action from the candidate set of annotation correction actions, based, at least in part, on a set of annotation correction confidence characteristics and based, at least in part, on an impact analysis performed against one or more ground truth annotations, the ground truth annotations having been performed by humans; automatically applying, by one or more processors, the selected annotation correction action to the first NLP annotator; and causing, by one or more processors, the first NLP annotator to annotate a new corpus, thereby producing new annotation(s) based on the applied annotation correction action.
地址 Armonk NY US