发明名称 System and methods for arabic text recognition and arabic corpus building
摘要 A method for automatically recognizing Arabic text includes building an Arabic dataset comprising Arabic text files written in different writing styles and actual meanings of the Arabic text corresponding to each of the Arabic text files (100), storing writing-style indices in association with the Arabic text files (105), digitizing a line of Arabic characters to form an array of pixels (321-323) (130), dividing the line of the Arabic characters into line images, (311-313) (120), forming a text feature vector from the line images (311-313) (140), training a Hidden Markov Model using the Arabic text files and ground truths in the Arabic dataset in accordance with the writing-style indices (160), and feeding the text feature vector into a Hidden Markov Model to recognize the line of Arabic characters (170).
申请公布号 EP2804131(A3) 申请公布日期 2016.09.07
申请号 EP20130184319 申请日期 2013.09.13
申请人 KING ABDULAZIZ CITY FOR SCIENCE AND TECHNOLOGY 发明人 KHORSHEED, MOHAMMAD S.;AL-OMARI, HUSSEIN K.;OSFOOR, MAJED IBRAHIM BIN;ALOBAID, ABDULAZIZ OBAID;ALFALEH, HUSSAM ABDULRAHMAN;ASFOUR, ARWA IBRAHEM BIN
分类号 G06K9/68;G06K9/62 主分类号 G06K9/68
代理机构 代理人
主权项
地址