发明名称 A METHOD FOR IDENTIFYING PDF DOCUMENT
摘要 The present invention discloses a method for identifying PDF document. wherein, it comprises the following steps: S1: analyzing the path objects in the PDF document, and identifying the forms in PDF document; S2: analyzing the text objects outside the form regions in the PDF document, and recognizing the text contents in the PDF document; S3: writing the identified results into a temporary file, or writing them into the PDF document as an attachment. The method for identifying PDF document provided by the present invention could identify the tables, the paragraphs, titles, the tabulations and so on in the PDF document, thereby, the PDF document can be edited with the paragraph as a unit, and be tagged conveniently to confirm the reading order, so as to facilitate the reading of people with visual impairment; in the same time, it also can derive document in other forms according to the identified results, which thereby greatly facilitates users to read and edit the PDF document.
申请公布号 US2016247020(A1) 申请公布日期 2016.08.25
申请号 US201414778155 申请日期 2014.03.14
申请人 Fujian Foxit Software Development Joint Stock Co., Ltd. 发明人 Fan Xiaolong
分类号 G06K9/00;G06F17/27;G06F17/24 主分类号 G06K9/00
代理机构 代理人
主权项 1. A method for identifying PDF document, wherein, comprising the following steps: S1: Analyzing the path objects in the PDF document, and identifying the table in the PDF document; S2: analyzing the text objects outside the form regions in the PDF document, and recognizing the text content in the PDF document; S3: Writing the identified results into a temporary file, or writing them into the PDF document as an attachment.
地址 Fuzhou CN