发明名称 |
Method, apparatus and computer program product for providing flexible text based language identification |
摘要 |
An apparatus for providing flexible text based language identification includes an alphabet scoring element, an n-gram frequency element and a processing element. The alphabet scoring element may be configured to receive an entry in a computer readable text format and to calculate an alphabet score of the entry for each of a plurality of languages. The n-gram frequency element may be configured to calculate an n-gram frequency score of the entry for each of the plurality of languages. The processing element may be in communication with the n-gram frequency element and the alphabet scoring element. The processing element may also be configured to determine a language associated with the entry based on a combination of the alphabet score and the n-gram frequency score.
|
申请公布号 |
US7552045(B2) |
申请公布日期 |
2009.06.23 |
申请号 |
US20060611964 |
申请日期 |
2006.12.18 |
申请人 |
NOKIA CORPORATION |
发明人 |
BARLIGA BOGDAN;HARJU MIKKO A.;ISO-SIPILA JUHA |
分类号 |
G06F17/20;G06F17/28 |
主分类号 |
G06F17/20 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|