发明名称 ARC FILTERING IN A SYNTACTIC GRAPH
摘要 The present disclosure provides methods and systems for performing syntactic analysis of a text. In some implementations the method includes performing rough syntactic analysis of the text, generating a graph of generalized constituents of the text and filtering arcs of the graph of generalized constituents with a combination classifier which includes a tree classifier and one or more linear classifiers. The combination classifier is trained using parallel analysis of an untagged two-language text corpus.
申请公布号 US2015199331(A1) 申请公布日期 2015.07.16
申请号 US201514588690 申请日期 2015.01.02
申请人 ABBYY Infopoisk LLC 发明人 Anisimovich Konstantin;Zuev Konstantin Alekseevich
分类号 G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项 1. A computer-implemented method for text analysis, comprising: identifying a sentence; identifying a graph of generalized constituents for the sentence based on rough syntactic analysis of the lexical-morphological structure of the sentence, the graph of generalized constituents comprises arcs and nodes; filtering the arcs of the graph of generalized constituents using a combination classifier; identifying a syntactic structure of the sentence by performing precise syntactic analysis of the sentence based on the filtered graph of generalized constituents of the sentence.
地址 Moscow RU