摘要 |
A plurality of source text files are read, representing similar information but in different natural languages. The files have correlated layouts, in that the same layout commands are employed at similar points in the files. Similar text, from the respective files, is aligned by identifying its position between equivalent word processing commands. Preferably, intermediate files are produced in which the word processing (WP) commands are converted into an identifiable form. Aligned text, which differs between the intermediate files whereas WP commands will not differ, is identified by a differential comparison operation, such as a call to DIFF within a UNIX environment.
|