发明名称 Method, apparatus and computer program product for providing flexible text based language identification
摘要 An apparatus for providing flexible text based language identification includes an alphabet scoring element, an n-gram frequency element and a processing element. The alphabet scoring element may be configured to receive an entry in a computer readable text format and to calculate an alphabet score of the entry for each of a plurality of languages. The n-gram frequency element may be configured to calculate an n-gram frequency score of the entry for each of the plurality of languages. The processing element may be in communication with the n-gram frequency element and the alphabet scoring element. The processing element may also be configured to determine a language associated with the entry based on a combination of the alphabet score and the n-gram frequency score.
申请公布号 US7552045(B2) 申请公布日期 2009.06.23
申请号 US20060611964 申请日期 2006.12.18
申请人 NOKIA CORPORATION 发明人 BARLIGA BOGDAN;HARJU MIKKO A.;ISO-SIPILA JUHA
分类号 G06F17/20;G06F17/28 主分类号 G06F17/20
代理机构 代理人
主权项
地址