发明名称 |
System and method of providing machine translation from a source language to a target language |
摘要 |
A machine translation method, system for using the method, and computer readable media are disclosed. The method includes the steps of receiving a source language sentence, selecting a set of target language n-grams using a lexical classifier and based on the source language sentence. When selecting the set of target language n-grams, in at least one n-gram, n is greater than 1. The method continues by combining the selected set of target language n-grams as a finite state acceptor (FSA), weighting the FSA with data from the lexical classifier, and generating an n-best list of target sentences from the FSA. As an alternate to using the FSA, N strings may be generated from the n-grams and ranked using a language model. The N strings may be represented by an FSA for efficiency but it is not necessary. |
申请公布号 |
US8849665(B2) |
申请公布日期 |
2014.09.30 |
申请号 |
US200812022819 |
申请日期 |
2008.01.30 |
申请人 |
AT&T Intellectual Property I, L.P. |
发明人 |
Bangalore Srinivas;Ettelaie Emil |
分类号 |
G10L15/00;G10L15/18;G06F17/28 |
主分类号 |
G10L15/00 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method comprising:
selecting, via a processor using a lexical classifier, a bag of target language n-grams associated with a source language sentence, wherein the bag of target language n-grams comprises a beginning n-gram having a start token in a first word position, and an ending n-gram having an end token in a second word position, the end token connecting a history node to a final state node; combining the bag of target language n-grams to yield an n-gram network; ranking N strings in the n-gram network using a language model to yield an n-best list of target sentences; and generating, via the processor, a target sentence based on the n-best list. |
地址 |
Atlanta GA US |