摘要 |
<P>PROBLEM TO BE SOLVED: To easily form an electronic document with link information from a general paper document. <P>SOLUTION: A TI separation processing part 210 analyzes image data obtained by scanning the paper document by a scanner device to separate them to a character part and an image part. An OCR processing part 220 performs character recognition processing to the character part separated by the processing part 210, and outputs text data. A link object detection part 230 analyzes the text data obtained by the processing part 220 and detects specified character strings that correspond to link objects (a link source and a link destination) (e.g., Fig. 1, etc.), and a link destination discrimination part 240 discriminates the link destination from the detected link objects. A link generation part 250 generates a link from link object character strings other than the link destination (or link source character strings) to the link destination character string, and buries the generated link information in the electronic document. <P>COPYRIGHT: (C)2006,JPO&NCIPI |