发明名称 RESTORATION OF MODIFIED DOCUMENT TO ORIGINAL STATE
摘要 Techniques are disclosed for restoring a modified document to an original state. The modified document is scanned into a digital form using an optical scanning device. The content of the modified digital document including one or more annotations is then grouped into several components, including text, images, form fields and text boxes, and marked shapes, based on corresponding component specifications. Each component is then categorized as being structured or unstructured. Structured components that correspond with representative entries in a component repository, such as text in a standard font size, weight and style, are identified as core document content. Unstructured components are identified as annotated document content or highlighted document content, depending on certain characteristics of the components. The categorized and identified components can then be presented separately or in various combinations.
申请公布号 US2016124813(A1) 申请公布日期 2016.05.05
申请号 US201414529620 申请日期 2014.10.31
申请人 ADOBE SYSTEMS INCORPORATED 发明人 Jain Ajay
分类号 G06F11/14;G06F17/24;G06F17/30 主分类号 G06F11/14
代理机构 代理人
主权项 1. A computer-implemented data processing method comprising: receiving, by a processor, a document comprising content original to the document and content added to the document by a document user; comparing, by the processor, the document to representative document components in a component repository, wherein the original content has a characteristic that matches at least one of the representative document components, and wherein the added content has no characteristic that matches any of the representative document components; identifying, by the processor based on the comparison, a first portion of the document containing the original content and a second portion of the document containing the added content; and generating, by the processor, a copy of the document in which the first portion of the document containing the original content is included and the second portion of the document containing added content is excluded.
地址 San Jose CA US