发明名称 Automated text annotation for construction of natural language understanding grammars
摘要 Aspects described herein provide various approaches to annotating text samples in order to construct natural language grammars. A text sample may be selected for annotation. A set of annotation candidates may be generated based on the text sample. A classifier may be used to score the set of annotation candidates in order to obtain a set of annotation scores. One of the annotation candidates may be selected as a suggested annotation for the text sample based on the set of annotation scores. A grammar rule may be derived based on the suggested annotation, and a grammar may be configured to include the annotation-derived grammar rule.
申请公布号 US9524289(B2) 申请公布日期 2016.12.20
申请号 US201414188206 申请日期 2014.02.24
申请人 Nuance Communications, Inc. 发明人 Rachevsky Leonid;Bakis Raimo;Ramabhadran Bhuvana
分类号 G06F17/27;G10L15/06;G10L15/187;G06F17/30 主分类号 G06F17/27
代理机构 Banner & Witcoff, Ltd. 代理人 Banner & Witcoff, Ltd.
主权项 1. A computer-implemented method of constructing a grammar comprising: training, by a computing device, a classifier using a confirmed annotation, the training comprising updating a corpus of the classifier with a feature extracted from the confirmed annotation, the feature comprising a hypernym of the confirmed annotation and at least one word of the confirmed annotation, the at least one word being adjacent to the hypernym in the confirmed annotation, and wherein extracting the feature comprises: obtaining a concatenated hypernym by concatenating at least two hypernyms of the confirmed annotation, obtaining a sequence comprising the concatenated hypernym, and extracting the hypernym from a substring of the confirmed annotation corresponding to the sequence; selecting, by a computing device, a digital text sample to annotate; transforming, by the computing device, the text sample into a set of annotation candidates; scoring, by the computing device, the set of annotation candidates using the classifier to obtain a set of annotation scores respectively for the set of annotation candidates; selecting, by the computing device, one of the annotation candidates in the set of annotation candidates as a suggested annotation for the text sample based on the set of annotation scores; deriving, by the computing device, an annotation-derived grammar rule based on the suggested annotation; and configuring, by the computing device, a digital grammar to include the annotation-derived grammar rule.
地址 Burlington MA US