摘要 |
PROBLEM TO BE SOLVED: To associate character strings of data parts corresponding to headings of a form, with the headings even in a form wherein character strings of data parts corresponding to heading parts are not regularly arranged in vertical and horizontal directions. SOLUTION: Headings and character strings in itemization rows are associated with each other in order according to ease of specification, and remaining items difficult to analyze are last specified by an elimination method. First, rows wherein headings and character strings of itemization rows correspond to each other in 1:1 are specified (S12), and next, fixed-length character strings are specified (S14), and variable-length non-divided character strings are specified (S15), and variable-length divided character strings corresponding to headings in 1:1 are specified (S16), and variable-length divided character strings corresponding to headings in 1:N are specified (S17), and these character strings are associated with headings. An item in a plurality of itemization rows is specified (S18) and divided character strings are integrated into one character string (S19). COPYRIGHT: (C)2011,JPO&INPIT
|