发明名称 Language recognition method, system and software
摘要 A language such as English or a language group such as an Asian language group is recognized based upon document image data. The document image data is processed to determine a minimal circumscribing rectangle for each character. The layout characteristics of the minimal circumscribing rectangles are quantified in a discrete number of ranges. The layout characteristic information includes a certain ratio with respect to the minimal circumscribing rectangle height and width as well as a black pixel density in the minimal circumscribing rectangle. Based upon the quantified layout characteristic information, an occurrence probability of a predetermined number of characters is determined using training data for a predetermined number of languages. The occurrence probability is stored in a table for later reference for an unknown input language.
申请公布号 US2005027511(A1) 申请公布日期 2005.02.03
申请号 US20040903131 申请日期 2004.07.30
申请人 OHGURO YOSHIHISA 发明人 OHGURO YOSHIHISA
分类号 G06F17/27;G06K9/68;(IPC1-7):G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址