发明名称 |
ARC FILTERING IN A SYNTACTIC GRAPH |
摘要 |
The present disclosure provides methods and systems for performing syntactic analysis of a text. In some implementations the method includes performing rough syntactic analysis of the text, generating a graph of generalized constituents of the text and filtering arcs of the graph of generalized constituents with a combination classifier which includes a tree classifier and one or more linear classifiers. The combination classifier is trained using parallel analysis of an untagged two-language text corpus. |
申请公布号 |
US2015199331(A1) |
申请公布日期 |
2015.07.16 |
申请号 |
US201514588690 |
申请日期 |
2015.01.02 |
申请人 |
ABBYY Infopoisk LLC |
发明人 |
Anisimovich Konstantin;Zuev Konstantin Alekseevich |
分类号 |
G06F17/27 |
主分类号 |
G06F17/27 |
代理机构 |
|
代理人 |
|
主权项 |
1. A computer-implemented method for text analysis, comprising:
identifying a sentence; identifying a graph of generalized constituents for the sentence based on rough syntactic analysis of the lexical-morphological structure of the sentence, the graph of generalized constituents comprises arcs and nodes; filtering the arcs of the graph of generalized constituents using a combination classifier; identifying a syntactic structure of the sentence by performing precise syntactic analysis of the sentence based on the filtered graph of generalized constituents of the sentence. |
地址 |
Moscow RU |