发明名称 |
METHOD, APPARATUS AND COMPUTER PROGRAM PRODUCT FOR PROVIDING FLEXIBLE TEXT BASED LANGUAGE IDENTIFICATION |
摘要 |
An apparatus for providing flexible text based language identification includes an alphabet scoring element, an n-gram frequency element and a processing element. The alphabet scoring element may be configured to receive an entry in a computer readable text format and to calculate an alphabet score of the entry for each of a plurality of languages. The n-gram frequency element may be configured to calculate an n-gram frequency score of the entry for each of the plurality of languages. The processing element may be in communication with the n-gram frequency element and the alphabet scoring element. The processing element may also be configured to determine a language associated with the entry based on a combination of the alphabet score and the n-gram frequency score.
|
申请公布号 |
KR20090099069(A) |
申请公布日期 |
2009.09.21 |
申请号 |
KR20097014832 |
申请日期 |
2007.12.12 |
申请人 |
NOKIA CORPORATION |
发明人 |
BARLIGA BOGDAN;HARJU MIKKO A.;ISO SIPILA JUHA |
分类号 |
G06F17/27;G06F17/28 |
主分类号 |
G06F17/27 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|