主权项 |
1. A parameters adjustment method in a statistical machine translation, characterized in that, the method comprises the following steps:
Step 1: building a language model required for translation by utilizing a monolingual corpora; Step 2: building a phrase translation model by utilizing a bilingual parallel corpus; Step 3: Step 3: processing parameters adjustment for λm by utilizing an objective functionminλ∑s=1n[-Σm=1Mλmhm(es,fs)+log∑e∈Csexp{Σm=1Mλmhm(fs,e′)+l(e′,es)}],where es refers to reference translation, e′ refers to machine translation, fs refers to sentence in source language awaiting for translation processing, hm (es,fs) and hm (fs,e′) refer to the characteristics used in building the translation system, the characteristics comprises four main categories, which are language model, phrase translation listing, sequence model and correctional word penalty items, m=1, . . . , M, M refers to the total number of characteristics, l(e′, es) refers to cost function, C5 refers to the collection set of machine translation candidate, e′ ε C5. |