发明名称 METHOD AND SYSTEM FOR MOTIF EXTRACTION IN ELECTRONIC DOCUMENTS
摘要 A method, system, and computer program product for extracting text motifs from the electronic documents is disclosed. A user provides a largest-maximal repeat or a super-maximal repeat as a first text block. The occurrences of the first text block are detected to identify the second text blocks in the vicinity of the occurrences of the first text block on the basis of pre-defined parameters. The text motifs are determined by combining the first text block and the second text block. Finally, the text motifs are extracted from the electronic documents.
申请公布号 US2014074455(A1) 申请公布日期 2014.03.13
申请号 US201213608312 申请日期 2012.09.10
申请人 GALLE MATTHIAS;RENDERS JEAN-MICHEL;XEROX CORPORATION 发明人 GALLE MATTHIAS;RENDERS JEAN-MICHEL
分类号 G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址