摘要 |
Disclosed in the present invention are a method for automatically converting a PDF document file and a device thereof. According to the present invention, tables in a requested PDF document file are analyzed according to standard operation of the conversion-requested PDF document file; the analyzed tables are converted into a standard document based on predefined reference data and extracted as a cell-level data image instead of text; and the converted standard document is converted into an XML document according to an XML letter conversion format to be XML-structured. As the tables inserted in the PDF document file are accurately converted into and provided as an XML document file, the quality of document format conversion is fundamentally improved. |