发明名称 Document alignment systems for legacy document conversions
摘要 A method for aligning documents which may be in different XML formats includes inputting source and target leaves of a source and documents in first and second tree structured formats and assigning a cost to each of a plurality of matches. Each match may include a source leaf and a target leaf or be an unmatched source or target leaf. Matches are identified for which a total cost is minimal, wherein each of the leaves is in at least one of the identified matches. From the identified matches, groups of two or more matches are identified which have a leaf in common. From the groups, probable matches are identified in which more that one target leaf is matched with at least one source leaf or more than one source leaf is matched with a target leaf. An alignment between leaves of the target document and leaves of the source document is output which includes the probable matches.
申请公布号 US2007150443(A1) 申请公布日期 2007.06.28
申请号 US20050315458 申请日期 2005.12.22
申请人 XEROX CORPORATION. 发明人 BERGHOLZ ANDRE;CHIDLOVSKII BORIS
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址