发明名称 Methods and systems for generation of document structures based on sequential constraints
摘要 Disclosed is a method that structures a sequentially-ordered set of elements (S305), each being characterized by a set of features. N-grams (sequence of n features) are computed (S320) from a set for n contiguous elements, and n-grams which are repetitive (Kleene cross) are selected (S330). Elements matching the most frequent repetitive n-gram are grouped together under a new node, and a new sequence is created (S335). The method is iteratively applied (S340) to this new sequence. The output is an ordered set of trees.
申请公布号 EP2811425(A2) 申请公布日期 2014.12.10
申请号 EP20140171115 申请日期 2014.06.04
申请人 XEROX CORPORATION 发明人 DEJEAN, HERVE
分类号 G06K9/62;G06K9/00;G06K9/72 主分类号 G06K9/62
代理机构 代理人
主权项
地址