摘要 |
<p>Methods, systems, and apparatus, including computer program products, for performing transliteration between text in different scripts. In one aspect, a method includes generating a transliteration model based on statistical information derived from parallel text having first text in an input script and corresponding second text in an output script; and using the transliteration model to transliterate input characters in the input script to output characters in the output script. In another aspect, a method includes performing word level transliterations. In another aspect, a method includes using an entry-aligned dictionary of source and target script pairs, in which, whenever a particular source word is mapped to multiple target words, the dictionary includes an entry for each target word including the same source word repeated in each entry. In another aspect, a method includes using phonetic scores of words in different scripts to identify corresponding parallel text.</p> |
申请人 |
GOOGLE INC.;KATRAGADDA, LALITESH;DESHPANDE, PAWAN;DUTTA, ANUPAMA;ARORA, NITIN |
发明人 |
KATRAGADDA, LALITESH;DESHPANDE, PAWAN;DUTTA, ANUPAMA;ARORA, NITIN |