发明名称 Translating texts between languages
摘要 Methods and computer systems for translating sentences between languages from an intermediate language-independent semantic representation are provided. Based on a comprehensive understanding about languages and semantics, exhaustive linguistic descriptions are used to analyze sentences, build syntactic structures and language independent semantic structures and representations, and synthesize one or more sentences in a natural or artificial language. A computer system is also provided to analyze and synthesize various linguistic structures and perform translation of a wide spectrum of various sentence types. As result, a generalized data structure, such as a semantic structure, is generated from a sentence of an input language and can be transformed into a natural sentence expressing its meaning correctly in an output language. The methods and systems can be applied to automated abstracting, machine translation, natural language processing, control systems, Internet information retrieval, etc.
申请公布号 US9053090(B2) 申请公布日期 2015.06.09
申请号 US201213528716 申请日期 2012.06.20
申请人 ABBYY InfoPoisk LLC 发明人 Anisimovich Konstantin;Selegey Vladimir;Zuev Konstantin
分类号 G06F17/28;G06F17/27 主分类号 G06F17/28
代理机构 代理人 Weiland LeighAnn;Weinstein Veronica
主权项 1. A computer-implemented method of translating a source text in a source language into an output language, comprising: obtaining, using a processor, the source text in the source language from an input detected in a device; dividing, using a processor, the source text into sentences; translating, using a processor, each sentence of the source text, said translating comprising: executing a first syntactic analysis, said first syntactic analysis including generating a set of possible relationships between words of the respective sentence and building a graph of generalized constituents;executing a second syntactic analysis of each sentence, said second syntactic analysis comprising: putting forward at least one hypothesis related to a syntactic structure of the sentence, wherein the putting forward the at least one hypothesis comprises performing a lexical selection to select a first lexical meaning from a first lexical class for a node of a syntactic tree of the sentence from a set of possible lexical classes; andverifying the hypothesis about the syntactic structure of the sentence;generating a language-independent semantic structure of the sentence based at least in part on the verified hypothesis;generating a syntactic structure of an output sentence in the output language based at least in part on the generated language-independent semantic structure; andperforming a morphological synthesis of the output sentence in the output language based at least in part on the generated syntactic structure of the output sentence; generating, using a processor, a text in the output language based on said translating of each of the sentences of the source text; detecting, using a processor, a translation error associated with incorrect lexical selection comprising comparing the first lexical class of the selected first lexical meaning to a topic of the source text; and repeating, using a processor, translation of the identified sentences with a second lexical meaning from a second lexical class, wherein the second lexical class is different from the first lexical class.
地址 RU