摘要 |
The present invention obtains a set of word pairs. Each word of the set of word pairs is broken into its component characters, or clusters of commonly co-occurring characters, and using a conventional statistical machine translation algorithm, transliteration models are generated. The transliteration models are used to obtain correct spellings of original language source words from a transliterated form.
|