发明名称 IMAGE READING DEVICE, EXTRACTION METHOD FOR DICTIONARY REGISTRATION OBJECT WORD/PHRASE AND PROGRAM
摘要 PROBLEM TO BE SOLVED: To provide an image reading device for extracting not only words but also their attribute information from a document. SOLUTION: This image reading device is provided with an image reading means for reading the image of an original, and for generating input image data, a layout analyzing means for generating layout information from the input image data, an image dividing means for dividing the input image data into a plurality of small regions based on the layout information, a data base in which an identifier for specifying the small region having a title character string/image among the plurality of small regions and its title character string/image and an identifier for specifying the small region having an information character string/image and its information character string/image are stored so as to be associated with each other, an information extracting means for extracting registration object words and phrases from the small region having the title character string/image, and for extracting the attribute information from the small region having the information character string/image based on the information stored in the data base and an output means for outputting the dictionary registration object words and phrases. COPYRIGHT: (C)2007,JPO&INPIT
申请公布号 JP2006277104(A) 申请公布日期 2006.10.12
申请号 JP20050092626 申请日期 2005.03.28
申请人 FUJI XEROX CO LTD 发明人 TANAKA KEI;TATENO SHOICHI;KOYAMA TOSHIYA;NAGAO TAKASHI;SAKAKIBARA MASAYOSHI;SAITO TERUKA;HO SHINU;NAKAMURA KOTARO
分类号 G06F17/30;G06K9/20;G06K9/72 主分类号 G06F17/30
代理机构 代理人
主权项
地址