发明名称 LANGUAGE SEGMENTATION OF MULTILINGUAL TEXTS
摘要 The claimed subject matter provides a system and/or method for segmenting a multi-language text. An exemplary method comprises determining an initial probability distribution for sentences in the multi-language text, the initial probability distribution indicating the likelihood of each sentence being in each of a set of languages. A probability of language transitions across sentences may be learned based on the initial probability distribution. Additionally, a highest probability language sequence of sentences in the multi-language text may be determined based on a combination of the probability of language transitions and the prior probability distribution provided by an initial model.
申请公布号 US2012203540(A1) 申请公布日期 2012.08.09
申请号 US201113022630 申请日期 2011.02.08
申请人 AUE ANTHONY;MICROSOFT CORPORATION 发明人 AUE ANTHONY
分类号 G06F17/20 主分类号 G06F17/20
代理机构 代理人
主权项
地址