发明名称 MACHINE LEARNING FOR TRANSLITERATION
摘要 <p>Methods, systems, and apparatus, including computer program products, for performing transliteration between text in different scripts. In one aspect, a method includes generating a transliteration model based on statistical information derived from parallel text having first text in an input script and corresponding second text in an output script; and using the transliteration model to transliterate input characters in the input script to output characters in the output script. In another aspect, a method includes performing word level transliterations. In another aspect, a method includes using an entry-aligned dictionary of source and target script pairs, in which, whenever a particular source word is mapped to multiple target words, the dictionary includes an entry for each target word including the same source word repeated in each entry. In another aspect, a method includes using phonetic scores of words in different scripts to identify corresponding parallel text.</p>
申请公布号 WO2008109769(A1) 申请公布日期 2008.09.12
申请号 WO2008US56087 申请日期 2008.03.06
申请人 GOOGLE INC.;KATRAGADDA, LALITESH;DESHPANDE, PAWAN;DUTTA, ANUPAMA;ARORA, NITIN 发明人 KATRAGADDA, LALITESH;DESHPANDE, PAWAN;DUTTA, ANUPAMA;ARORA, NITIN
分类号 G06F17/28 主分类号 G06F17/28
代理机构 代理人
主权项
地址