METHOD OF RECOGNIZING TEXT INFORMATION FROM A VECTOR/RASTER IMAGE,申请号US20100816307-传众专利搜索

发明名称	METHOD OF RECOGNIZING TEXT INFORMATION FROM A VECTOR/RASTER IMAGE
摘要	A method is claimed for processing a vector-raster image file which contains a text image. The method comprises the steps of: fragmenting the image to obtain regions containing non-separable, logically connected fragments of text of the maximum possible size; processing text, vector, and raster objects; discarding excessive information; analyzing each object with the help of all available information. The step of processing text objects includes the steps of: dividing into separate characters and character groups according to supposed locations of blank spaces or other non-indicated symbols, and analyzing and assembling character groups into words and verifying and correcting characters encoding based on recognition of assembled words as raster objects. The step of processing vector objects includes the step of identifying separators, background, and substrates of blocks. The step of processing raster objects includes the steps of: analyzing non-text objects on order to detect text images within them, and/or detecting vector objects other than separators.
申请公布号	US2010254606(A1)	申请公布日期	2010.10.07
申请号	US20100816307	申请日期	2010.06.15
申请人	ABBYY SOFTWARE LTD	发明人	MASALOVITCH ANTON;KUZNETSOV SERGEY;DERIAGUINE DMITRI
分类号	G06K9/34	主分类号	G06K9/34
代理机构		代理人
主权项
地址