发明名称 Automatic language identification by stroke geometry analysis
摘要 A computer-implemented process identifies an unknown language used to create a document. A set of training documents is defined in a variety of known languages and formed from a variety of text styles. Black and white electronic pixel images are formed of text material forming the training documents and the document in the unknown language. A plurality of line strokes are defined from the black pixels and point features are extracted from the strokes that are effective to characterize each of the languages. Point features from the unknown language are compared with point features from the known languages to identify one of the known languages that best represents the unknown language.
申请公布号 US6064767(A) 申请公布日期 2000.05.16
申请号 US19980008225 申请日期 1998.01.16
申请人 REGENTS OF THE UNIVERSITY OF CALIFORNIA 发明人 MUIR, DOUGLAS W.;THOMAS, TIMOTHY R.
分类号 G06K9/68;(IPC1-7):G06K9/46 主分类号 G06K9/68
代理机构 代理人
主权项
地址
您可能感兴趣的专利